whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-08-04 14:52:30 +02:00

Author	SHA1	Message	Date
Georgi Gerganov	3172006a24	ggml : fix some compile warnings	2023-11-12 16:36:20 +02:00
Georgi Gerganov	684bc8bd70	readme : update GPU / CUDA	2023-11-12 15:40:37 +02:00
Georgi Gerganov	b0502836b8	whisper : add full CUDA and Metal offloading (#1472 ) * whisper : migrate to ggml-backend * whisper : fix logit reading * whisper : fix tensor allocation during load * whisper : fix beam-search with CUDA * whisper : free backends + fix compile warning * whisper : print when CUDA is enabled * whisper : fix CoreML * make : clean-up * talk : fix compile warning * whisper : support ggml_conv with CUDA and Metal (#1473) * ggml : add CUDA support for ggml_conv * whisper : remove ggml_repeat for conv bias + single backend * cuda : fix im2col kernel * metal : add im2col support + mul mat-vec f16 x f16 * bench-all : add q4 models * whisper : clean-up * quantize-all : fix * ggml : im2col opts * whisper : avoid whisper_model_data wrapper * whisper : add note that ggml_mul_mat_pad does not work with CUDA * whisper : factor out graph compute in common function * whisper : fixes * whisper : fix UB with measure buffers * whisper : try to fix the parallel whisper_state functionality (#1479) * whisper : try to fix the parallel whisper_state functionality * whisper : fix multi-state Metal * whisper : free backend instances in whisper_state	2023-11-12 15:31:08 +02:00
Ben Nortier	ec7a6f04f9	whisper : return with error from whisper_encode_internal and whisper_decode_internal when abort callback is true (#1456 ) Co-authored-by: Ben Nortier <ben@bjnortier.com>	2023-11-10 13:51:16 +02:00
Jakub Ráček	37947203e6	talk-llama : add language auto detect (#1467 ) * Add '-l auto' to talk-llama example * Update examples/talk-llama/talk-llama.cpp --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2023-11-09 19:21:44 +02:00
bobqianic	953419c69a	openvino : update convert-whisper-to-openvino.py to support v3 (#1459 )	2023-11-09 12:42:39 +02:00
Xiao-Yong Jin	0de8582f65	coreml : use the correct `n_mel` value (#1458 )	2023-11-08 20:01:41 +00:00
Ben Nortier	baeb733691	whisper : reset mel time when resetting timings (#1452 ) Co-authored-by: Ben Nortier <ben@bjnortier.com>	2023-11-08 15:52:23 +02:00
Sindre Sorhus	d03c60dd7f	ios : add support for Swift Package Manager (#1370 ) * Add support for Swift * Make it build in Xcode * Use the SPM package in the SwiftUI example app	2023-11-07 23:53:31 +02:00
Georgi Gerganov	6a5d195109	release : v1.4.3 v1.4.3	2023-11-07 16:15:48 +02:00
Georgi Gerganov	0cbef75422	ggml : fix MIN / MAX macro re-definition	2023-11-07 16:08:46 +02:00
Georgi Gerganov	2cdfc4e025	whisper : add support for large v3 (#1444 ) * whisper : add support for large v3 * bench : fix build + fix go bindings * bench : fix n_mels * models : update readme	2023-11-07 15:30:18 +02:00
Tobrun	973111088b	android : decouple example into a library and app module (#1445 )	2023-11-07 14:27:33 +02:00
Ben Nortier	11b503055e	whisper : reset ctx->t_start_us when calling whisper_reset_timings() (#1434 ) Co-authored-by: Ben Nortier <ben@bjnortier.com>	2023-11-07 11:04:32 +02:00
Georgi Gerganov	b629d2d4fe	cmake : fix talk-llama build	2023-11-07 11:03:21 +02:00
Georgi Gerganov	3bd7d48f51	metal : fix asserts for setThreadgroupMemoryLength (close #1435 )	2023-11-07 11:02:16 +02:00
iamthad	435a6b74e3	ci : fix variable names in GitHub actions config (#1440 ) * Remove _SUPPORT from variables * Change blasdir to OPENBLAS_PATH * Update OpenBLAS URLs	2023-11-07 10:53:24 +02:00
Jhen-Jie Hong	75dc800d21	talk-llama : fix n_gpu_layers usage again (#1442 )	2023-11-07 10:51:27 +02:00
Georgi Gerganov	0c91aef2d8	whisper : add missing about callback initializers	2023-11-07 10:49:51 +02:00
Jhen-Jie Hong	3989b29a9b	examples : fix n_gpu_layers usage in talk-llama (#1441 )	2023-11-07 01:36:23 +00:00
Jhen-Jie Hong	0463028bc2	whisper : add context param to disable gpu (#1293 ) * whisper : check state->ctx_metal not null * whisper : add whisper_context_params { use_gpu } * whisper : new API with params & deprecate old API * examples : use no-gpu param && whisper_init_from_file_with_params * whisper.objc : enable metal & disable on simulator * whisper.swiftui, metal : enable metal & support load default.metallib * whisper.android : use new API * bindings : use new API * addon.node : fix build & test * bindings : updata java binding * bindings : add missing whisper_context_default_params_by_ref WHISPER_API for java * metal : use SWIFTPM_MODULE_BUNDLE for GGML_SWIFT and reuse library load * metal : move bundle var into block * metal : use SWIFT_PACKAGE instead of GGML_SWIFT * style : minor updates --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2023-11-06 11:04:24 +02:00
Georgi Gerganov	39cfad0dee	whisper : add support for new distilled Whisper models (#1424 ) * whisper : add support for new distilled Whisper models * whisper : print log when using distilled models	2023-11-05 19:43:45 +02:00
Georgi Gerganov	6d4d0b5b4b	cuda : fix HIPBLAS build	2023-11-05 19:41:15 +02:00
Georgi Gerganov	f96e1c5b78	sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422 ) * sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) * metal : allow env metal variable to override resource path (#1415) * Allow env variable to override resource path * Update ggml-metal.m --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * sync : restore common / main from `master` * sync : restore whisper from `master` * talk-llama : update to latest llama.cpp * ruby : fix build * ggml : fix 32-bit ARM build * ggml : fix MIN / MAX macro collisions + update ios bindings * ggml : fix ifdefs and MIN / MAX again * exampels : fix Obj-C and Swift examples * ggml : fix 32-bit ARM compatibility * ggml : one more attempt to fix 32-bit ARM compat * whisper : fix support for larger graphs --------- Co-authored-by: Chris Raethke <codesoda@users.noreply.github.com>	2023-11-03 21:35:05 +02:00
bobqianic	8a2bee6717	models : use absolute paths for the converted model (#1356 )	2023-11-03 10:44:27 +02:00
Asad Memon	d445098c8f	talk-llama : move up-to-date demo to top (#1417 )	2023-11-02 18:50:13 +02:00
Georgi Gerganov	74de25158e	talk-llama : add an up-to-date demo video	2023-11-02 15:28:48 +02:00
Aarni Koskela	bce49a260e	examples : Implement JSON output for Token-Level data in main (#1358 )	2023-10-31 19:54:52 +00:00
WhiteOlivierus	45c87b5481	models : Faster download for models on windows using BitTransfer (#1404 )	2023-10-30 19:18:12 +00:00
ai-at-home	dfe4bc6e59	README : Update README in stream to clarify where to compile from (Issue #1400 ) * Clarify doc about where to compile from * Update examples/stream/README.md * Update examples/stream/README.md * Update README.md --------- Co-authored-by: AI @ Home <> Co-authored-by: bobqianic <129547291+bobqianic@users.noreply.github.com>	2023-10-29 17:11:13 +00:00
Johan	54c978c3a3	binding : Expose the audio_ctx param through the Go binding (#1368 ) * expose the audio_ctx param through the go binding * expose the audio_ctx param to the go binding context	2023-10-15 13:35:06 +01:00
jorismertz	9a7074d4aa	README : fix typo (#1362 )	2023-10-13 16:53:23 +01:00
joecryptotoo	a0040f5d12	docker : Add dockerfile for cublas (#1286 ) * Create Dockerfile * Rename Dockerfile to cublas.Dockerfile * Rename cublas.Dockerfile to .devops/cublas.Dockerfile --------- Co-authored-by: bobqianic <129547291+bobqianic@users.noreply.github.com>	2023-10-11 11:00:17 +01:00
mkiol	940cdb1396	whisper : abort callback improvements (#1345 ) * whisper : initialize abort_callback to null * whisper : add example how to use abort_callback	2023-10-08 17:22:24 +03:00
Marcin Mielniczuk	1b775cdd68	cmake : Abort the build if a requested feature could not be configured (#1350 )	2023-10-07 20:01:18 +01:00
Marcin Mielniczuk	80bf931668	cmake : Prefer pkg-config while looking for BLAS (#1349 )	2023-10-07 15:02:07 +01:00
Xiang (Kevin) Li	91c0b23384	models : add conversion scripts from HuggingFace models to CoreML (#1304 )	2023-10-04 12:00:25 +03:00
mkiol	2f668c330e	whisper : add abort callback (#1335 )	2023-10-04 11:57:55 +03:00
bobqianic	08fa34882f	examples : move wav_writer from stream.cpp to common.h (#1317 ) * Allocate class on the stack instead of on the heap * Add class wav_writer * fix some minor issues * fix some minor issues * remove potential misleading API	2023-10-03 22:56:11 +03:00
Didzis Gosko	4037705531	whisper : add missing speaker turn API function for whisper_state (#1330 )	2023-10-03 22:55:48 +03:00
brunofaustino	c76c11e59c	examples: Update the README for Talk - fixing the gpt2 URL (#1334 )	2023-10-01 04:21:32 +08:00
Neil Chudleigh	9edbd0a204	extra: Add benchmark script implemented in Python (#1298 ) * Create bench.py * Various benchmark results * Update benchmark script with hardware name, and file checks * Remove old benchmark results * Add git shorthash * Round to 2 digits on calculated floats * Fix the header reference when sorting results * FIx order of models * Parse file name * Simplify filecheck * Improve print run print statement * Use simplified model name * Update benchmark_results.csv * Process single or lists of processors and threads * Ignore benchmark results, dont check in * Move bench.py to extra folder * Readme section on how to use * Move command to correct location * Use separate list for models that exist * Handle subprocess error in git short hash check * Fix filtered models list initialization	2023-09-25 23:45:15 +08:00
litong	707507ff6d	Examples: Add save audio to file option in stream.cpp (#1310 ) * save the recorded audio to a file * Alignment -help * Save the correct audio * chage to a consistent coding style * Correct typo * Update examples/stream/stream.cpp * Update examples/stream/stream.cpp * Correct variable misuse * Update examples/stream/stream.cpp * Update examples/stream/stream.cpp * Update examples/stream/stream.cpp * Update examples/stream/stream.cpp --------- Co-authored-by: bobqianic <129547291+bobqianic@users.noreply.github.com>	2023-09-22 23:43:21 +08:00
JJ	7e1592d2cd	readme: Fix spelling error (#1290 ) Fixed branding error: Javascript to JavaScript	2023-09-21 15:55:33 +08:00
Artyom Mezin	903c9579b8	examples: Update README.md of main.cpp (#1306 )	2023-09-18 22:14:36 +08:00
Jhen-Jie Hong	b440ef8c96	binding : fix ruby build by adding missing ggml-alloc (#1305 )	2023-09-18 21:15:45 +08:00
Evgeny Kuznetsov	700f63a806	bench: fix missing include <cstring> (#1303 )	2023-09-18 15:51:10 +08:00
Georgi Gerganov	951a119926	whisper : increase tokenizer buffer (close #1259 )	2023-09-15 21:11:43 +03:00
Georgi Gerganov	1ca4041b86	talk-llama : update to latest llama.cpp	2023-09-15 20:06:31 +03:00
Georgi Gerganov	80c1512fd5	sync : ggml (const correctness)	2023-09-15 14:49:56 +03:00

1 2 3 4 5 ...

766 Commits