whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-08-03 09:49:42 +02:00

Files

Johannes Gäßler 15d71189e9 CUDA: optimize and refactor MMQ (llama/8416)

* CUDA: optimize and refactor MMQ

* explicit q8_1 memory layouts, add documentation

2024-08-08 22:48:46 +03:00

2024-06-26 19:34:09 +03:00

2024-08-08 22:48:46 +03:00

2024-08-08 22:48:46 +03:00

.gitignore

2024-06-26 19:34:09 +03:00

CMakeLists.txt

2024-08-08 22:48:46 +03:00

ggml_vk_generate_shaders.py

2024-06-26 19:34:09 +03:00