Georgi Gerganov
|
45ddda8e0c
|
ggml : drop support for QK_K=64 (llama/7473)
* ggml : drop support for QK_K=64
ggml-ci
* opencl : restore QK_K=256 define
|
2024-06-16 18:19:48 +03:00 |
|
Meng, Hengyu
|
f12e982c0b
|
Disable iqx on windows as WA (llama/6435)
* disable iqx on windows as WA
* array instead of global_memory
|
2024-04-07 16:15:57 +03:00 |
|
Georgi Gerganov
|
2948c740a2
|
sync : ggml (#2001)
* sync : update scripts
* sync : ggml
* talk-llama : sync llama.cpp
* make : WHISPER_CUBLAS -> WHISPER_CUDA
* ci : try to fix sycl build
* talk-llama : fix make build
|
2024-03-27 18:55:10 +02:00 |
|
Georgi Gerganov
|
3753a2b2a8
|
ggml : add ggml-common.h
|
2024-03-15 14:01:14 +02:00 |
|