Georgi Gerganov
|
d9efb664ac
|
sync : ggml
|
2024-11-01 10:19:05 +02:00 |
|
Georgi Gerganov
|
4e10afb5a9
|
scripts : sync amx
|
2024-10-31 22:13:24 +02:00 |
|
Georgi Gerganov
|
55e422109b
|
scripts : add turbo-q8_0 to the benchmark
|
2024-10-29 19:37:24 +02:00 |
|
Georgi Gerganov
|
8a35b58c4f
|
scripts : bench v3-turbo
|
2024-10-05 16:22:53 +03:00 |
|
Georgi Gerganov
|
941912467d
|
whisper : adapt to latest ggml (skip) (#0)
|
2024-10-05 15:23:51 +03:00 |
|
Georgi Gerganov
|
f7d55e0614
|
scripts : sync ggml-backend.cpp
|
2024-10-05 15:23:51 +03:00 |
|
Georgi Gerganov
|
ff2cb0811f
|
sync : ggml
|
2024-10-03 12:22:17 +03:00 |
|
Georgi Gerganov
|
2ef717b293
|
whisper : add large-v3-turbo (#2440)
|
2024-10-01 15:57:06 +03:00 |
|
Georgi Gerganov
|
1133ac98a8
|
ggml : add ggml-cpu-impl.h (skip) (#0)
|
2024-09-24 19:45:08 +03:00 |
|
Georgi Gerganov
|
76d27eec9a
|
sync : ggml
|
2024-09-24 19:45:08 +03:00 |
|
Georgi Gerganov
|
2abaf19e0d
|
sync : ggml
|
2024-09-02 15:24:50 +03:00 |
|
Georgi Gerganov
|
8cc90a0e80
|
sync : ggml
|
2024-08-28 13:22:20 +03:00 |
|
Georgi Gerganov
|
9e3c5345cd
|
sync : ggml vulkan (ggml/0)
ggml-ci
|
2024-08-21 11:07:13 +03:00 |
|
Georgi Gerganov
|
22fcd5fd11
|
sync : ggml
|
2024-08-12 11:59:15 +03:00 |
|
Georgi Gerganov
|
4b9c4de1ad
|
sync : ggml
|
2024-08-09 09:58:16 +03:00 |
|
Georgi Gerganov
|
3ab19c744e
|
scripts : sync cann
|
2024-08-09 09:58:16 +03:00 |
|
Georgi Gerganov
|
22058f2dbc
|
talk-llama : sync llama.cpp
|
2024-08-08 22:48:46 +03:00 |
|
Georgi Gerganov
|
5b7979a1e6
|
sync : ggml
|
2024-08-08 22:48:46 +03:00 |
|
Georgi Gerganov
|
701265bf38
|
scripts : sync new files (#0)
|
2024-08-08 22:48:46 +03:00 |
|
Georgi Gerganov
|
425e2910a3
|
sync : ggml
|
2024-07-08 14:53:55 +03:00 |
|
Georgi Gerganov
|
ff08e30ab5
|
scripts : fix sync scripts
|
2024-07-08 14:53:55 +03:00 |
|
Georgi Gerganov
|
c118733a29
|
sync : ggml + fix sync script
|
2024-06-26 23:20:19 +03:00 |
|
Georgi Gerganov
|
dc8cc2dd6f
|
whisper : disable CUDA mel + fix FFMPEG
|
2024-06-26 20:11:38 +03:00 |
|
Georgi Gerganov
|
3efedb9511
|
sync : ggml
|
2024-06-26 19:40:23 +03:00 |
|
Georgi Gerganov
|
e30c679928
|
whisper : reorganize source code + improve CMake (#2256)
* scripts : update sync [no ci]
* files : reorganize [no ci]
* sync : llama.cpp
* cmake : link math library
* cmake : build normal ggml library
* files : move headers to include
* objc : fix path to ggml-metal.h
* ci : fix WHISPER_CUDA -> GGML_CUDA
* scripts : sync LICENSE [no ci]
|
2024-06-26 19:34:09 +03:00 |
|
Georgi Gerganov
|
54d5823ebe
|
scripts : sync ggml-blas
|
2024-06-18 09:39:40 +03:00 |
|
Georgi Gerganov
|
4a6e6e8b30
|
sync : ggml
|
2024-06-18 09:39:40 +03:00 |
|
Georgi Gerganov
|
63a767a134
|
scripts : stop sync whisper example from ggml
|
2024-06-18 09:39:40 +03:00 |
|
Georgi Gerganov
|
3b1ac03828
|
ggml : remove OpenCL (#0)
|
2024-06-16 18:19:48 +03:00 |
|
Georgi Gerganov
|
3c7cc5c437
|
sync : ggml
ggml-ci
|
2024-06-16 18:19:48 +03:00 |
|
Georgi Gerganov
|
5b7073cae1
|
scripts : update sync
|
2024-06-16 12:41:42 +03:00 |
|
Georgi Gerganov
|
7094ea5e75
|
whisper : use flash attention (#2152)
* whisper : use flash attention in the encoder
* whisper : add kv_pad
* whisper : remove extra backend instance (huh?)
* whisper : use FA for cross-attention
* whisper : use FA for self-attention
* whisper : simplify encoder FA
* whisper : add flash_attn runtime parameter
* scripts : add bench log
* scripts : add M1 Pro bench log
|
2024-05-15 09:38:19 +03:00 |
|
Georgi Gerganov
|
f56b8305c4
|
sync : ggml
|
2024-05-14 19:16:32 +03:00 |
|
Georgi Gerganov
|
130f43e4b8
|
scripts : sync ggml-rpc
|
2024-05-14 19:15:35 +03:00 |
|
Georgi Gerganov
|
fe179ae0cc
|
sync : ggml
|
2024-05-13 11:02:26 +03:00 |
|
Georgi Gerganov
|
8f253ef3af
|
sync : ggml
|
2024-04-09 20:27:55 +03:00 |
|
Georgi Gerganov
|
c7dc37f97c
|
license : update copyright notice + add AUTHORS
|
2024-04-09 20:27:44 +03:00 |
|
Georgi Gerganov
|
3b8aade3c2
|
scripts : update sync
|
2024-04-09 20:25:50 +03:00 |
|
Georgi Gerganov
|
52ccd4a3a8
|
files : rename ./extra to ./scripts
|
2024-04-09 20:13:41 +03:00 |
|