7a2abb311d
grammars : add assistant + update comments
2023-09-09 20:24:58 +03:00
54d168db67
command : grammar-related improvements
...
- option to read grammar from file
- add sample grammars for colors and chess moves
- fine-tune the performance further
2023-09-09 20:05:57 +03:00
b8f34d1ed7
whisper : fine-tuning grammar functionality
2023-09-06 17:05:05 +03:00
97ebb48b99
command : fix exception when recognizing the command
2023-09-06 15:05:17 +03:00
31476ccc0e
whisper : add grammar-based sampling
2023-08-30 20:00:31 -04:00
8ba42095c5
Revert "ggml : do not use _GNU_SOURCE gratuitously ( #1027 )"
...
This reverts commit 3f7a03ebe3
.
2023-07-02 21:53:52 +03:00
3f7a03ebe3
ggml : do not use _GNU_SOURCE gratuitously ( #1027 )
...
* Do not use _GNU_SOURCE gratuitously.
What is needed to build whisper.cpp and examples is availability of
stuff defined in The Open Group Base Specifications Issue 6
(https://pubs.opengroup.org/onlinepubs/009695399/ ) known also as
Single Unix Specification v3 (SUSv3) or POSIX.1-2001 + XSI extensions.
There is no need to penalize musl libc which simply follows standards.
Not having feature test macros in source code gives greater flexibility
to those wanting to reuse it in 3rd party app, as they can build it with
minimal FTM (_XOPEN_SOURCE=600) or other FTM depending on their needs.
It builds without issues in Alpine (musl libc), Ubuntu (glibc), MSYS2.
* examples : include SDL headers before other headers
This is an attempt at fixing macOS build error coming from SDL2 relying
on Darwin extension memset_pattern4/8/16 coming from Apple's string.h.
2023-06-25 16:34:30 +03:00
4a7d49af95
examples : fix + refactor Levenshtein distance
2023-04-30 19:12:49 +03:00
5fd1bdd7fc
whisper : add GPU support via cuBLAS ( #834 )
...
* make : add WHISPER_CUBLAS
* make : fix CUBLAS build
* whisper : disable Flash Attention + adjust memory buffers
* whisper : remove old commented code
* readme : add cuBLAS instructions
* cmake : add WHISPER_CUBLAS option
* gitignore : ignore build-cublas
2023-04-30 12:14:33 +03:00
09d7d2b68e
examples : refactor in order to reuse code and reduce duplication ( #482 )
...
* examples : refactor common code into a library
* examples : refactor common SDL code into a library
* make : update Makefile to use common libs
* common : fix MSVC M_PI ..
* addon.node : link common lib
2023-02-15 19:28:10 +02:00
8de452c18b
Improve decoding ( #291 )
...
* whisper : prepare infra for new decoding strategies
* whisper : apply logit filters and compute logprobs
* whisper : add whisper_get_logits()
* whisper : separate self and cross attention memory
Initial step needed for supporting parallel decoders
* whisper : move probs_id buffer to whisper_context
* whisper : refactor kv cache into separate struct
* whisper : move self-attention kv cache to whisper_decoder
* whisper : wip decoding parameters + strategies
* whisper : wip decoding parameters + strategies (part 2)
* whisper : wip decoding parameters + strategies (part 3)
* whisper : wip decoding parameters + strategies (part 4)
* whisper : fix prompt_past update to not include prompt_init
* whisper : temperature + best_of support
* whisper : support for compression_ration_threshold
We actually use entropy, but it is similar
* command : fix example to use logits instead of obsolete probs
* whisper : handle empty sequence ranking
* whisper : add WHISPER_DEBUG + diagnostic prints + new main args
* whisper : minor fixes
* whisper : add beam-search support
* whisper : bug fix when there no previous context
* whisper : add comments
* stream : disable temperature fallback
For real-time processing, we always want a single decoder running at T=0
* whisper.swiftui : update example - fix paths + add empty folders
2023-01-15 11:29:57 +02:00
1512545149
whisper : add loader class to allow loading from buffer and others ( #353 )
...
* whisper : add loader to allow loading from other than file
* whisper : rename whisper_init to whisper_init_from_file
* whisper : add whisper_init_from_buffer
* android : Delete local.properties
* android : load models directly from assets
* whisper : adding <stddef.h> needed for size_t + code style
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com >
2023-01-08 13:03:33 +02:00
52a3e0c92a
ggml : improve vec_dot_f16 unrolling in flash_attn_f16
2023-01-08 11:41:18 +02:00
d1ea1220ff
command : clean-up / refactoring / formatting ( #383 )
2023-01-07 21:43:24 +02:00
9c4a1522f6
command : always-prompt mode ( #383 )
2023-01-07 21:41:11 +02:00
6b351bb669
command : add "guided-mode" video demo in the README.md
2023-01-06 18:59:26 +02:00
84c6b42e65
cmake : update to 3.19 ( #351 )
...
- update from 3.0 (from 2014) to 3.19 (from 2020)
- move some global setting onto the targets (through a cmake include)
2023-01-05 21:22:48 +02:00
f00509d57c
command : refactor to split command list & general transcription modes ( #331 )
...
This makes it easier to understand if you're looking for only one of the capabilities.
2022-12-31 14:08:57 +02:00
dc90efd504
examples : small code cleanups ( #322 )
...
- remove unnecessary initialization of string to ""
- use empty() instead of checking size()
- use emplace_back instead of push_back
- use nullptr instead of NULL
- remove unnecessary call to .data() on string
- use character overload of find_first_of() instead of passing a string
2022-12-23 20:18:51 +02:00
99da1e5cc8
cmake : enable and fix -Wall -Wextra -Wpedantic C++ warnings
2022-12-19 20:45:08 +02:00
4312995974
command : better indentation
2022-12-16 19:38:18 +02:00
5eeeb3412d
command : update README, show how to use guided mode
2022-12-16 19:38:18 +02:00
6a69e3ae27
command : adding guided mode
2022-12-16 19:38:18 +02:00
3b1aacbe6d
talk : talk with AI in the terminal
2022-12-10 16:51:58 +02:00
164df0d447
whisper.objc : fix context + broken readme links
2022-11-27 10:52:27 +02:00
68ecadbbc9
command.wasm : add voice assistant example for the Web ( #171 )
...
Same as the command-line tool "command", but runs in the browser
Also, added helper script "extra/deploy-wasm.sh" and fixed some timing
constants for the WASM examples.
2022-11-26 11:40:06 +02:00
be16dfa038
whisper.wasm : do not block page while processing ( close #86 )
2022-11-25 23:07:42 +02:00
1246dd023e
command : add demonstration video
2022-11-25 20:23:58 +02:00
0be27bbd92
command : fix build + fix README + add bold printing
2022-11-25 19:53:50 +02:00
bc88eb13c6
examples : add "command" tool ( #171 )
2022-11-25 19:36:57 +02:00