whisper.cpp/examples
Georgi Gerganov b0502836b8
whisper : add full CUDA and Metal offloading (#1472)
* whisper : migrate to ggml-backend

* whisper : fix logit reading

* whisper : fix tensor allocation during load

* whisper : fix beam-search with CUDA

* whisper : free backends + fix compile warning

* whisper : print when CUDA is enabled

* whisper : fix CoreML

* make : clean-up

* talk : fix compile warning

* whisper : support ggml_conv with CUDA and Metal (#1473)

* ggml : add CUDA support for ggml_conv

* whisper : remove ggml_repeat for conv bias + single backend

* cuda : fix im2col kernel

* metal : add im2col support + mul mat-vec f16 x f16

* bench-all : add q4 models

* whisper : clean-up

* quantize-all : fix

* ggml : im2col opts

* whisper : avoid whisper_model_data wrapper

* whisper : add note that ggml_mul_mat_pad does not work with CUDA

* whisper : factor out graph compute in common function

* whisper : fixes

* whisper : fix UB with measure buffers

* whisper : try to fix the parallel whisper_state functionality (#1479)

* whisper : try to fix the parallel whisper_state functionality

* whisper : fix multi-state Metal

* whisper : free backend instances in whisper_state
2023-11-12 15:31:08 +02:00
..
addon.node whisper : add context param to disable gpu (#1293) 2023-11-06 11:04:24 +02:00
bench whisper : add support for large v3 (#1444) 2023-11-07 15:30:18 +02:00
bench.wasm whisper : add support for large v3 (#1444) 2023-11-07 15:30:18 +02:00
command whisper : add context param to disable gpu (#1293) 2023-11-06 11:04:24 +02:00
command.wasm whisper : add context param to disable gpu (#1293) 2023-11-06 11:04:24 +02:00
lsp whisper : add context param to disable gpu (#1293) 2023-11-06 11:04:24 +02:00
main whisper : add context param to disable gpu (#1293) 2023-11-06 11:04:24 +02:00
quantize quantize : fix load vocab crash when len is 128 (#1160) 2023-08-06 11:04:42 +03:00
stream whisper : add context param to disable gpu (#1293) 2023-11-06 11:04:24 +02:00
stream.wasm whisper : add context param to disable gpu (#1293) 2023-11-06 11:04:24 +02:00
talk whisper : add full CUDA and Metal offloading (#1472) 2023-11-12 15:31:08 +02:00
talk-llama talk-llama : add language auto detect (#1467) 2023-11-09 19:21:44 +02:00
talk.wasm whisper : add context param to disable gpu (#1293) 2023-11-06 11:04:24 +02:00
whisper.android android : decouple example into a library and app module (#1445) 2023-11-07 14:27:33 +02:00
whisper.nvim examples : vim plugin and LSP server (#1144) 2023-08-27 21:35:06 +03:00
whisper.objc whisper : add context param to disable gpu (#1293) 2023-11-06 11:04:24 +02:00
whisper.swiftui ios : add support for Swift Package Manager (#1370) 2023-11-07 23:53:31 +02:00
whisper.wasm whisper : add context param to disable gpu (#1293) 2023-11-06 11:04:24 +02:00
CMakeLists.txt examples : vim plugin and LSP server (#1144) 2023-08-27 21:35:06 +03:00
common-ggml.cpp ggml : sync latest ggml lib 2023-06-25 14:30:44 +03:00
common-ggml.h whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
common-sdl.cpp examples : refactor in order to reuse code and reduce duplication (#482) 2023-02-15 19:28:10 +02:00
common-sdl.h examples : refactor in order to reuse code and reduce duplication (#482) 2023-02-15 19:28:10 +02:00
common.cpp sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422) 2023-11-03 21:35:05 +02:00
common.h whisper : add full CUDA and Metal offloading (#1472) 2023-11-12 15:31:08 +02:00
dr_wav.h refactoring : move main + stream in examples + other stuff 2022-10-25 20:53:48 +03:00
generate-karaoke.sh minor : add comment for using "generate_karaoke.sh" 2022-11-26 10:22:42 +02:00
helpers.js whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
livestream.sh whisper : add support for large v3 (#1444) 2023-11-07 15:30:18 +02:00
twitch.sh whisper : add support for large v3 (#1444) 2023-11-07 15:30:18 +02:00
yt-wsp.sh yt-wsp.sh : print help on empty args 2023-02-18 09:42:31 +02:00