Georgi Gerganov
|
80c1512fd5
|
sync : ggml (const correctness)
|
2023-09-15 14:49:56 +03:00 |
|
Georgi Gerganov
|
bfc73f1fa2
|
sync : ggml (CUDA faster rope)
|
2023-09-08 15:01:26 +03:00 |
|
Georgi Gerganov
|
c3f319d7c2
|
ggml : sync latest llama.cpp (view_src + alloc improvements) (#1247)
* ggml : sync latest llama.cpp (view_src + alloc improvements)
* ggml : fix build
|
2023-09-05 20:57:27 +03:00 |
|
Georgi Gerganov
|
59a3d0cb57
|
ggml : sync (ggml-alloc, GPU, eps, etc.) (#1220)
* ggml : sync (ggml-alloc, GPU, eps, etc.)
* ggml : fix build
* wasm : fix build
|
2023-09-05 13:54:40 +03:00 |
|
ardfork
|
cb5fb0a12d
|
whisper : initial hipBLAS support (#1209)
|
2023-08-27 20:03:58 +03:00 |
|
Georgi Gerganov
|
d6509bf78d
|
ggml : sync latest repo (mostly refactoring changes)
|
2023-07-02 21:46:09 +03:00 |
|
Georgi Gerganov
|
5feb0dffba
|
ggml : sync latest ggml lib
|
2023-06-25 14:30:44 +03:00 |
|
Georgi Gerganov
|
e410cfc3ce
|
ggml : sync latest ggml repo
- new Q4 and Q8 quantization
- updated CUDA
|
2023-05-20 18:56:30 +03:00 |
|
Georgi Gerganov
|
e693074aa6
|
ggml : sync latest ggml
- New Q4 and Q5 formats
- Various improvements
|
2023-05-14 18:04:23 +03:00 |
|
Georgi Gerganov
|
0bcb64b184
|
ggml : sync ggml (clBLAST + tensor names)
|
2023-05-02 21:24:18 +03:00 |
|
Georgi Gerganov
|
acec73ab6e
|
ggml : sync latest ggml + llama.cpp updates (quantization)
|
2023-04-29 12:32:28 +03:00 |
|