Nicholas Albion
|
bc89f285d8
|
bindings : add java bindings (#931)
* WIP - java bindings
* updated README
* failed attempt at JNI
* fullTranscribe() test passes
* tested on Ubuntu 20
* link to Java bindings
|
2023-05-20 18:25:02 +03:00 |
|
Georgi Gerganov
|
794b162a46
|
whisper : add integer quantization support (#540)
* whisper : add integer quantization support
* examples : add common-ggml + prepare to add "quantize" tool
* whisper : quantization tool ready
* whisper : fix F32 support
* whisper : try to fix shared lib linkage
* wasm : update quantized models to Q5
* bench.wasm : remove "medium" button
* bench.wasm : fix custom model button
* ggml : add Q5_0 and Q5_1 WASM SIMD
* wasm : add quantized models to all WASM examples
* wasm : bump DB version number to 2
* talk-llama : update example to latest llama.cpp
* node : increase test timeout to 10s
* readme : add information for model quantization
* wasm : add links to other examples
|
2023-04-30 18:51:57 +03:00 |
|
Georgi Gerganov
|
5fd1bdd7fc
|
whisper : add GPU support via cuBLAS (#834)
* make : add WHISPER_CUBLAS
* make : fix CUBLAS build
* whisper : disable Flash Attention + adjust memory buffers
* whisper : remove old commented code
* readme : add cuBLAS instructions
* cmake : add WHISPER_CUBLAS option
* gitignore : ignore build-cublas
|
2023-04-30 12:14:33 +03:00 |
|
Georgi Gerganov
|
5e47e223bd
|
whisper : add Core ML support (#566)
* coreml : use Core ML encoder inference
* coreml : simlpify whisper_encode + log messages
* whisper : resolve rebase conflicts
* coreml : add scripts for CoreML model generation
* bench-all : recognize COREML flag
|
2023-04-15 13:21:27 +03:00 |
|
Georgi Gerganov
|
34b772727d
|
gitignore : add .test
|
2023-04-14 20:13:47 +03:00 |
|
Georgi Gerganov
|
4a0deb8b1e
|
talk-llama : add new example + sync ggml from llama.cpp (#664)
* talk-llama : talk with LLaMA AI
* talk.llama : disable EOS token
* talk-llama : add README instructions
* ggml : fix build in debug
|
2023-03-27 21:00:32 +03:00 |
|
Georgi Gerganov
|
ad1389003d
|
release : v1.2.1
|
2023-02-28 22:29:12 +02:00 |
|
Lukas Rist
|
02c7516c57
|
go : added wrappers to reset and print timings (#436)
|
2023-01-25 18:57:30 +02:00 |
|
Georgi Gerganov
|
9a65269a20
|
.gitignore : add arm_neon.h
|
2023-01-23 20:19:04 +02:00 |
|
Georgi Gerganov
|
8de452c18b
|
Improve decoding (#291)
* whisper : prepare infra for new decoding strategies
* whisper : apply logit filters and compute logprobs
* whisper : add whisper_get_logits()
* whisper : separate self and cross attention memory
Initial step needed for supporting parallel decoders
* whisper : move probs_id buffer to whisper_context
* whisper : refactor kv cache into separate struct
* whisper : move self-attention kv cache to whisper_decoder
* whisper : wip decoding parameters + strategies
* whisper : wip decoding parameters + strategies (part 2)
* whisper : wip decoding parameters + strategies (part 3)
* whisper : wip decoding parameters + strategies (part 4)
* whisper : fix prompt_past update to not include prompt_init
* whisper : temperature + best_of support
* whisper : support for compression_ration_threshold
We actually use entropy, but it is similar
* command : fix example to use logits instead of obsolete probs
* whisper : handle empty sequence ranking
* whisper : add WHISPER_DEBUG + diagnostic prints + new main args
* whisper : minor fixes
* whisper : add beam-search support
* whisper : bug fix when there no previous context
* whisper : add comments
* stream : disable temperature fallback
For real-time processing, we always want a single decoder running at T=0
* whisper.swiftui : update example - fix paths + add empty folders
|
2023-01-15 11:29:57 +02:00 |
|
Georgi Gerganov
|
054940e1f6
|
minor : fix .gitignore to not ignore examples
|
2022-12-11 11:39:46 +02:00 |
|
Georgi Gerganov
|
3b1aacbe6d
|
talk : talk with AI in the terminal
|
2022-12-10 16:51:58 +02:00 |
|
Georgi Gerganov
|
832b4f34c9
|
make : indentation + .gitignore
|
2022-12-08 19:42:06 +02:00 |
|
Georgi Gerganov
|
bc88eb13c6
|
examples : add "command" tool (#171)
|
2022-11-25 19:36:57 +02:00 |
|
Georgi Gerganov
|
b8ce25dec1
|
refactoring : more readable code
|
2022-11-25 19:28:04 +02:00 |
|
Georgi Gerganov
|
c6710efde2
|
refactoring : move main + stream in examples + other stuff
|
2022-10-25 20:53:48 +03:00 |
|
Georgi Gerganov
|
bb1ee266d2
|
ios : whisper.objc example
|
2022-10-24 18:23:07 +03:00 |
|
Georgi Gerganov
|
d6b84b2a23
|
ref #62 : fix build for some compilers
For some reason, new version of GCC panic when the struct type is not
specified explicitly
|
2022-10-18 10:57:03 +03:00 |
|
Georgi Gerganov
|
b4a3875b2c
|
Revert recent sampling change
It does not actually help and seems to produce worse results on some of
the samples
|
2022-10-18 08:26:16 +03:00 |
|
Georgi Gerganov
|
0e858f080d
|
close #56 : build on FreeBSD
Thanks to @abelbabel for the contribution
|
2022-10-17 18:10:16 +03:00 |
|
Borislav Stanimirov
|
28252352d7
|
Visual Studio ignored dirs
|
2022-10-11 20:57:33 +03:00 |
|
Georgi Gerganov
|
2f069335ab
|
Adding sanitizer tests
|
2022-10-08 11:43:42 +03:00 |
|
Georgi Gerganov
|
877c058179
|
Add CMake support
|
2022-10-08 09:02:41 +03:00 |
|
Georgi Gerganov
|
b6bf906730
|
ref #10 : quick-and-dirty attempt for real-time audio transciption
- Processes input in chunks of 3 seconds.
- Padding audio with silence
- Uses 1 second audio from previous pass
- No text context
|
2022-10-02 17:55:45 +03:00 |
|
Georgi Gerganov
|
b0a11594ae
|
Initial release
|
2022-09-25 22:13:49 +03:00 |
|