whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-08-09 16:26:54 +02:00

Files

Georgi Gerganov b6c5f49b78 whisper : add batched decoding (#1486 )

* whisper : add whisper_batch

* whisper : move kv_self to whisper_state

* whisper : full batched decoding support

* whisper : fix memory leak in whisper_batch

* whisper : fix mem leak again + remove oboslete function

* whisper : clear kv cache when using whisper_decode API

* whisper : speed-up sampling

* whisper : fix decoders initializer

* bench : add batch size 5 bench

* whisper : add comment about the KV cache size

* whisper : add check for max number of decoders

* whisper : avoid starting sampling threads with bs=1

* whisper : enable beam-search by default

* cuda : sync llama.cpp fixes

2023-11-15 16:12:52 +02:00

bench-all.sh

whisper : add batched decoding (#1486 )

2023-11-15 16:12:52 +02:00

bench-wts.sh

bench-wts.sh : rename script + add execute permission

2023-03-06 21:02:24 +02:00

bench.py

extra: Add benchmark script implemented in Python (#1298 )

2023-09-25 23:45:15 +08:00

convert-all.sh

whisper : add support for large v3 (#1444 )