Georgi Gerganov
|
8de452c18b
|
Improve decoding (#291)
* whisper : prepare infra for new decoding strategies
* whisper : apply logit filters and compute logprobs
* whisper : add whisper_get_logits()
* whisper : separate self and cross attention memory
Initial step needed for supporting parallel decoders
* whisper : move probs_id buffer to whisper_context
* whisper : refactor kv cache into separate struct
* whisper : move self-attention kv cache to whisper_decoder
* whisper : wip decoding parameters + strategies
* whisper : wip decoding parameters + strategies (part 2)
* whisper : wip decoding parameters + strategies (part 3)
* whisper : wip decoding parameters + strategies (part 4)
* whisper : fix prompt_past update to not include prompt_init
* whisper : temperature + best_of support
* whisper : support for compression_ration_threshold
We actually use entropy, but it is similar
* command : fix example to use logits instead of obsolete probs
* whisper : handle empty sequence ranking
* whisper : add WHISPER_DEBUG + diagnostic prints + new main args
* whisper : minor fixes
* whisper : add beam-search support
* whisper : bug fix when there no previous context
* whisper : add comments
* stream : disable temperature fallback
For real-time processing, we always want a single decoder running at T=0
* whisper.swiftui : update example - fix paths + add empty folders
|
2023-01-15 11:29:57 +02:00 |
|
Georgi Gerganov
|
054940e1f6
|
minor : fix .gitignore to not ignore examples
|
2022-12-11 11:39:46 +02:00 |
|
Georgi Gerganov
|
3b1aacbe6d
|
talk : talk with AI in the terminal
|
2022-12-10 16:51:58 +02:00 |
|
Georgi Gerganov
|
832b4f34c9
|
make : indentation + .gitignore
|
2022-12-08 19:42:06 +02:00 |
|
Georgi Gerganov
|
bc88eb13c6
|
examples : add "command" tool (#171)
|
2022-11-25 19:36:57 +02:00 |
|
Georgi Gerganov
|
b8ce25dec1
|
refactoring : more readable code
|
2022-11-25 19:28:04 +02:00 |
|
Georgi Gerganov
|
c6710efde2
|
refactoring : move main + stream in examples + other stuff
|
2022-10-25 20:53:48 +03:00 |
|
Georgi Gerganov
|
bb1ee266d2
|
ios : whisper.objc example
|
2022-10-24 18:23:07 +03:00 |
|
Georgi Gerganov
|
d6b84b2a23
|
ref #62 : fix build for some compilers
For some reason, new version of GCC panic when the struct type is not
specified explicitly
|
2022-10-18 10:57:03 +03:00 |
|
Georgi Gerganov
|
b4a3875b2c
|
Revert recent sampling change
It does not actually help and seems to produce worse results on some of
the samples
|
2022-10-18 08:26:16 +03:00 |
|
Georgi Gerganov
|
0e858f080d
|
close #56 : build on FreeBSD
Thanks to @abelbabel for the contribution
|
2022-10-17 18:10:16 +03:00 |
|
Borislav Stanimirov
|
28252352d7
|
Visual Studio ignored dirs
|
2022-10-11 20:57:33 +03:00 |
|
Georgi Gerganov
|
2f069335ab
|
Adding sanitizer tests
|
2022-10-08 11:43:42 +03:00 |
|
Georgi Gerganov
|
877c058179
|
Add CMake support
|
2022-10-08 09:02:41 +03:00 |
|
Georgi Gerganov
|
b6bf906730
|
ref #10 : quick-and-dirty attempt for real-time audio transciption
- Processes input in chunks of 3 seconds.
- Padding audio with silence
- Uses 1 second audio from previous pass
- No text context
|
2022-10-02 17:55:45 +03:00 |
|
Georgi Gerganov
|
b0a11594ae
|
Initial release
|
2022-09-25 22:13:49 +03:00 |
|