whisper.cpp

Author	SHA1	Message	Date
Georgi Gerganov	9bbca3110f	ref #9 : add API documentation in whisper.h	2022-10-08 18:09:56 +03:00
Georgi Gerganov	5e563ef635	Fix Makefile for MacBook Intel	2022-10-08 17:35:55 +03:00
Georgi Gerganov	2ca8cc77b2	ref #17 : print whisper logs to stderr Only the transcribed/translted text is printed to stdout. This way, one can redirect the result to a file.	2022-10-08 17:28:06 +03:00
Georgi Gerganov	8c7c018893	ref #17 : add options to output result to file Support for: - plain text - VTT - SRT	2022-10-08 17:22:22 +03:00
Georgi Gerganov	4c4ab71d4d	Update README.md	2022-10-08 11:46:34 +03:00
Georgi Gerganov	b43b36e006	Update tests	2022-10-08 11:43:42 +03:00
Georgi Gerganov	37110d693e	ci : add base model tests to GH Actions	2022-10-08 11:43:42 +03:00
Georgi Gerganov	2d47693435	Update README.md	2022-10-08 11:43:42 +03:00
Georgi Gerganov	a53e06757f	Create README.md	2022-10-08 11:43:42 +03:00
Georgi Gerganov	0e3ba2f9fc	Adding dummy models for testing purposes	2022-10-08 11:43:42 +03:00
Georgi Gerganov	2f069335ab	Adding sanitizer tests	2022-10-08 11:43:42 +03:00
Georgi Gerganov	29b041f79b	Cleanup CMakeLists.txt	2022-10-08 09:02:41 +03:00
Georgi Gerganov	4a732b2879	cmake : fixes	2022-10-08 09:02:41 +03:00
Georgi Gerganov	68f5962be6	ci : add cmake builds	2022-10-08 09:02:41 +03:00
Georgi Gerganov	332c9d77fe	whisper : fix bug in token sampling logic Could overflow buffer	2022-10-08 09:02:41 +03:00
Georgi Gerganov	877c058179	Add CMake support	2022-10-08 09:02:41 +03:00
Georgi Gerganov	481cd685d5	ref #10 : option to keep context in "stream" example Seems the results become worse when we keep the context, so by default this is not enabled	2022-10-07 22:30:44 +03:00
Georgi Gerganov	3f15bb8a08	ref #10 : add "step" argument for "stream" example Controls how often we run the inference. By default, we run it every 3 seconds.	2022-10-07 22:07:24 +03:00
Georgi Gerganov	7787b878e1	ref #16 , #22 : add "offset" argument Allows to start processing the input audio at some offset from the beginning. Useful for splitting a long job into multiple tasks.	2022-10-07 22:00:40 +03:00
Georgi Gerganov	e29a5dacc6	ref #11 , #18 , #26 : fix CACHE_LINE_SIZE constant	2022-10-07 21:56:44 +03:00
Georgi Gerganov	844d60b284	Add CI using Github Actions	2022-10-07 18:34:27 +03:00
Georgi Gerganov	700898e6ed	ref #22 : add option to provide multiple input .wav files	2022-10-05 23:44:10 +03:00
Georgi Gerganov	6b1c3cc198	Update README.md	2022-10-05 23:13:15 +03:00
Georgi Gerganov	b8f713482e	Minor updates	2022-10-05 23:11:02 +03:00
Georgi Gerganov	167324584b	wip : rpi4 support	2022-10-05 23:03:46 +03:00
Georgi Gerganov	ce1fe95902	wip : improve makefile	2022-10-05 23:03:46 +03:00
Georgi Gerganov	74197ffc11	Merge pull request #20 from ArtyomZemlyak/master Fix: main get language from cli args	2022-10-05 07:27:29 +03:00
Артём Земляк	495b81b367	Fix: main get n_threads from cli	2022-10-05 09:47:48 +07:00
Артём Земляк	f007e186fe	Fix: main get language from cli args	2022-10-05 09:24:53 +07:00
Georgi Gerganov	e7a15876f8	Update README.md	2022-10-04 23:27:25 +03:00
Georgi Gerganov	6814cc9b02	Improve result printing	2022-10-04 23:18:15 +03:00
Georgi Gerganov	eba33adadd	Extend C-style API with full inference methods	2022-10-04 23:18:15 +03:00
Georgi Gerganov	6b77124e01	Initial C-style interface for whisper.cpp	2022-10-04 23:18:15 +03:00
Georgi Gerganov	be8ba034f6	ref #10 : handle Ctrl+C in "stream" app	2022-10-02 20:11:17 +03:00
Georgi Gerganov	d71e567656	Update README.md	2022-10-02 18:19:22 +03:00
Georgi Gerganov	b6bf906730	ref #10 : quick-and-dirty attempt for real-time audio transciption - Processes input in chunks of 3 seconds. - Padding audio with silence - Uses 1 second audio from previous pass - No text context	2022-10-02 17:55:45 +03:00
Georgi Gerganov	77d929f603	Fix bug in FFT The FFT routine does not work for odd N Solution is to add DFT and use it when N is odd	2022-10-02 17:46:21 +03:00
Georgi Gerganov	6d654d192a	Fix reading of stereo WAV files	2022-10-01 08:41:57 +03:00
Georgi Gerganov	62897e8ae6	Update README.md	2022-10-01 00:01:04 +03:00
Georgi Gerganov	15b49e8baf	Bug fix Longer prompts could cause out-of-bounds access	2022-09-30 20:37:29 +03:00
Georgi Gerganov	3bcdbdfc32	Reduce memory usage even more + better sampling - The encode/decode memory buffers are now reused - If the 30-sec segment goes for too long without a timestamp token, we force one. Improves transcription for large model - Stereo support - Add "micro-machines.wav" sample	2022-09-30 19:35:27 +03:00
Georgi Gerganov	310f4883d1	Update README.md	2022-09-29 23:48:01 +03:00
Georgi Gerganov	fd3f3d748f	Update README.md	2022-09-29 23:37:59 +03:00
Georgi Gerganov	5877c3578e	ref #4 : added transcription timestamps Can be turned off with "-nt" argument. Performance has also improved.	2022-09-29 23:09:39 +03:00
Georgi Gerganov	8d4041c31f	Merge pull request #3 from cdosoftei/master Pass -pthread to linker	2022-09-28 22:06:09 +03:00
cdosoftei	d4fcfa47b0	Pass -pthread to linker	2022-09-28 15:01:54 -04:00
Georgi Gerganov	4352a6018b	Update README.md	2022-09-28 21:13:32 +03:00
Georgi Gerganov	f888c2373d	Flash + language support (ref #2 ) - Achieved big performance improvement + memory usage reduction - Can now translate / transcribe different languages	2022-09-28 21:07:32 +03:00
Georgi Gerganov	154fa796dd	ref #1 : add -pthread to compilation flags	2022-09-26 11:58:44 +03:00
Georgi Gerganov	476182e439	Update README.md and simplify usage	2022-09-26 09:36:51 +03:00

1 2

53 Commits