whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-06-05 09:17:06 +02:00

Author	SHA1	Message	Date
KITAITI Makoto	13d92d08ae	docs : fix VAD section heading levels (#3186 )	2025-05-23 10:38:26 +02:00
Daniel Bevenius	cbe557f9b1	docs : add VAD model download instructions [no ci] (#3180 )	2025-05-22 07:49:29 +02:00
Alpaim	273af4aab9	docs : replace typo "]"with ")" in README (#3179 )	2025-05-22 05:49:44 +02:00
Daniel Bevenius	e41bc5c61a	vad : add initial Voice Activity Detection (VAD) support (#3065 ) * vad : add initial Voice Activity Detection (VAD) support This commit add support for Voice Activity Detection (VAD). When enabled this feature will process the audio input and detect speech segments. This information is then used to reduce the number of samples that need to be processed by whisper_full. Resolves: https://github.com/ggml-org/whisper.cpp/issues/3003 --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2025-05-12 16:10:11 +02:00
Daniel Bevenius	db0fc9edc6	docs : fix -owts flag typo karaoke section [no ci] (#3142 )	2025-05-12 10:56:39 +02:00
Simon Booth	a513146102	docs : update Readme to recommend same Openvino as Python tools (#3138 )	2025-05-12 09:06:51 +02:00
R0CKSTAR	50218b935d	build : Add Moore Threads GPU support and update GitHub workflow for MUSA build (#3069 ) * Update PATH for main/main-cuda container Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * Add Dockerfile for musa, .dockerignore and update CI Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * Add Moore Threads GPU Support in README.md and replace ./main with whisper-cli Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * Forward GGML_CUDA/GGML_MUSA to cmake in Makefile Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * Minor updates for PATH ENV in Dockerfiles Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * Address comments Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> --------- Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>	2025-04-28 11:06:41 +03:00
Jeff Klassen	9cfcd6cc45	docs : update README.md to note newer nvidia gpus (#3031 ) Resolves: https://github.com/ggml-org/whisper.cpp/issues/3030	2025-04-11 08:54:51 +02:00
Fujimoto Seiji	33a25e4dda	docs : document how to use 'WHISPER_FFMPEG' build option (#3029 ) FFmpeg integration was introduced in 1b51fdf by William Tambellini, but not mentioned in the main documentation. Add a short guide on how to enable the feature. Confirmed to work on both Ubuntu 24.04 and Fedora 39. Signed-off-by: Fujimoto Seiji <fujimoto@ceptord.net>	2025-04-10 18:21:38 +02:00
Ekaitz Zárraga	43f5030aeb	docs : fix README.md (#3024 )	2025-04-09 19:49:37 +02:00
Georgi Gerganov	2b6d0d2200	rename : ggerganov -> ggml-org (#3005 )	2025-04-04 16:11:52 +03:00
Daniel Bevenius	cce5daf17b	docs : add xcframework section to README.md [no ci] (#2997 ) This adds a section to the README.md file that describes how to use the XCFramework. The modification for this is that is not obvious how to use the XCFramework and and example will help. One thing to note is that the example is using the latest release including the checksum. We are thinking about how we might automate this in the future but for now this is a good start.	2025-04-03 09:06:53 +02:00
Georgi Gerganov	2c502b3c00	readme : update roadmap link	2025-04-02 17:38:35 +03:00
Georgi Gerganov	51c6961c7b	release : v1.7.5	2025-04-02 16:39:48 +03:00
Page-MS	594a121f3e	readme : add note about SDL2 (#2946 ) Precise the README section about real time audio processing, stating that sdl2 is needed.	2025-03-26 09:30:59 +02:00
Daniel Bevenius	663cafc1e8	readme : update Python version to 3.11 for Core ML support [no -ci] (#2919 ) This commit updates the recommended version of Python to 3.11 for Core ML conversion support. It also adds the `-e` flag to the `generate-coreml-model.sh` script to ensure that the script exits on the first error. The motivation for this that when following the installation instructions using Python 3.10 I get the following error: ```console (venv) $ ./models/generate-coreml-model.sh base.en A module that was compiled using NumPy 1.x cannot be run in NumPy 2.1.3 as it may crash. To support both 1.x and 2.x versions of NumPy, modules must be compiled with NumPy 2.0. Some module may need to rebuild instead e.g. with 'pybind11>=2.12'. If you are a user of the module, the easiest solution will be to downgrade to 'numpy<2' or try to upgrade the affected module. We expect that some modules will need time to support NumPy 2. Traceback (most recent call last): File "/whisper-work/models/convert-whisper-to-coreml.py", line 2, in <module> import torch File "/whisper-work/venv/lib/python3.10/site-packages/torch/__init__.py", line 870, in <module> from . import _masked File "/whisper-work/venv/lib/python3.10/site-packages/torch/_masked/__init__.py", line 420, in <module> def sum(input: Tensor, File "/whisper-work/venv/lib/python3.10/site-packages/torch/_masked/__init__.py", line 223, in _apply_docstring_templates example_input = torch.tensor([[-3, -2, -1], [0, 1, 2]]) /whisper-work/venv/lib/python3.10/site-packages/torch/_masked/__init__.py:223: UserWarning: Failed to initialize NumPy: _ARRAY_API not found (Triggered internally at /Users/distiller/project/pytorch/torch/csrc/utils/tensor_numpy.cpp:68.) example_input = torch.tensor([[-3, -2, -1], [0, 1, 2]]) Minimum required torch version for importing coremltools.optimize.torch is 2.1.0. Got torch version 1.11.0. Traceback (most recent call last): File "/whisper-work/models/convert-whisper-to-coreml.py", line 4, in <module> import coremltools as ct File "/whisper-work/venv/lib/python3.10/site-packages/coremltools/__init__.py", line 120, in <module> from . import converters, models, optimize, proto File "/whisper-work/venv/lib/python3.10/site-packages/coremltools/converters/__init__.py", line 7, in <module> from . import libsvm, sklearn, xgboost File "/Users/danbev/work/ai/whisper-work/venv/lib/python3.10/site-packages/coremltools/converters/xgboost/__init__.py", line 6, in <module> from ._tree import convert File "/Users/danbev/work/ai/whisper-work/venv/lib/python3.10/site-packages/coremltools/converters/xgboost/_tree.py", line 9, in <module> from ._tree_ensemble import convert_tree_ensemble as _convert_tree_ensemble File "/Users/danbev/work/ai/whisper-work/venv/lib/python3.10/site-packages/coremltools/converters/xgboost/_tree_ensemble.py", line 11, in <module> from ...models.tree_ensemble import TreeEnsembleClassifier File "/Users/danbev/work/ai/whisper-work/venv/lib/python3.10/site-packages/coremltools/models/__init__.py", line 6, in <module> from . import ( File "/Users/danbev/work/ai/whisper-work/venv/lib/python3.10/site-packages/coremltools/models/ml_program/__init__.py", line 6, in <module> from . import compression_utils File "/Users/danbev/work/ai/whisper-work/venv/lib/python3.10/site-packages/coremltools/models/ml_program/compression_utils.py", line 8, in <module> from coremltools.converters.mil.mil import Operation as _Operation File "/Users/danbev/work/ai/whisper-work/venv/lib/python3.10/site-packages/coremltools/converters/mil/__init__.py", line 7, in <module> from .frontend.tensorflow.tf_op_registry import register_tf_op File "/Users/danbev/work/ai/whisper-work/venv/lib/python3.10/site-packages/coremltools/converters/mil/frontend/__init__.py", line 6, in <module> from . import tensorflow, tensorflow2, torch File "/Users/danbev/work/ai/whisper-work/venv/lib/python3.10/site-packages/coremltools/converters/mil/frontend/torch/__init__.py", line 11, in <module> from . import ops, quantization_ops File "/Users/danbev/work/ai/whisper-work/venv/lib/python3.10/site-packages/coremltools/converters/mil/frontend/torch/ops.py", line 36, in <module> from .internal_graph import InternalTorchIRGraph, InternalTorchIRNode File "/Users/danbev/work/ai/whisper-work/venv/lib/python3.10/site-packages/coremltools/converters/mil/frontend/torch/internal_graph.py", line 15, in <module> from .exir_utils import extract_io_from_exir_program File "/Users/danbev/work/ai/whisper-work/venv/lib/python3.10/site-packages/coremltools/converters/mil/frontend/torch/exir_utils.py", line 99, in <module> ) -> Dict[str, torch.fx.Node]: AttributeError: module 'torch' has no attribute 'fx' ``` Using Python3.11 the conversion script runs without any errors.	2025-03-21 10:31:55 +01:00
midnight	46d07b9c85	cmake : fix compile assumptions for power9/etc (#2777 ) * Add small comment re: VSX to readme Co-authored-by: midnight <midnight@example.com>	2025-02-05 14:41:10 +02:00
Georgi Gerganov	898c0cb9d1	readme : add maintenance roadmap	2025-02-04 10:50:10 +02:00
Jayant	b82d305282	readme : add docker instructions (#2711 ) I found the docker instructions to be useful in the README.md and the differences in docker variants such as ffmpeg and cuda support. However, this section was removed in v1.7.4 and I would vote to bring it back. This is a pull request to add that section back.	2025-01-07 13:20:51 +02:00
Georgi Gerganov	8a9ad7844d	release : v1.7.4	2025-01-06 15:13:48 +02:00
Samuel Durante	fb36a1538a	readme : fix real-time audio input example build instructions (#2692 )	2025-01-02 12:05:38 +02:00
Konosuke Sakai	85b60f31d0	docs : replace Core ML with OpenVINO (#2686 )	2025-01-02 12:03:02 +02:00
Georgi Gerganov	2e59dced12	whisper : rename binaries + fix install (#2648 ) * whisper : rename binaries + fix install * cont : try to fix ci * cont : fix emscripten builds	2024-12-21 09:43:49 +02:00
Georgi Gerganov	3de9deead5	release : v1.7.3	2024-12-18 18:12:40 +02:00
Georgi Gerganov	627b11c78a	readme : update build instructions	2024-12-08 20:14:35 +02:00
Georgi Gerganov	6266a9f9e5	release : v1.7.2	2024-11-19 18:54:22 +02:00
Georgi Gerganov	f02b40bcb4	update : readme	2024-11-15 16:00:10 +02:00
toboil-features	a5abfe6a90	readme : update links and make commands (#2489 ) * Update links to headers in README.md * Add link to Vulkan section in README.md * Add "-j" for parallelism for "make" in README.md * Update README.md	2024-10-17 13:25:18 +03:00
toboil-features	f7c99e49b3	readme : add Vulkan notice (#2488 ) * Add Vulkan notice in README.md * Fix formatting for Vulkan section in README.md * Fix formatting in README.md	2024-10-16 18:43:26 +03:00
Salman Faroz	746d173592	readme : update the Quick Start section (#2475 ) navigating into the directory	2024-10-14 10:44:57 +03:00
Georgi Gerganov	ebca09a3d1	release : v1.7.1	2024-10-07 13:06:48 +03:00
Georgi Gerganov	6a94163b91	release : v1.7.0	2024-10-05 16:43:26 +03:00
Georgi Gerganov	2ef717b293	whisper : add large-v3-turbo (#2440 )	2024-10-01 15:57:06 +03:00
Hugo	0d2e2aed80	readme : fix references to download-ggml-model.sh (#2427 ) The script itself has a hashbang indicating that it is a shell script, but the README indicates that it must be executed with `bash`. I checked the script itself, and it seems to be valid POSIX shell. I can confirm that it works with busybox sh. Clarify the reference on the README, so it is clear that bash is not actually a dependency for this script.	2024-09-24 21:07:51 +03:00
Mengqing Cao	a551933542	cann : add Ascend NPU instructions (#2410 )	2024-09-11 15:59:24 +03:00
UsernamesLame	9600fc3eb1	readme : remove invalid flag from Python example (#2396 ) * Update README.md Fix broken C-style API link * Update whisper_processor.py Update examples/python/whisper_processor.py to remove nonexistent flag "-np" from subprocess.Popen call. * Add pywhispercpp to the Pybind11 Python wrapper list abdeladim-s/pywhispercpp wasn't added to the list / was removed at some point (?) It was referenced in issue #9, so I feel like it's worthy of being added as it's the first if not one of the first Python wrappers for whisper.cpp	2024-08-30 14:00:38 +03:00
Georgi Gerganov	e2e55a6fed	readme : fix link (#2394 )	2024-08-30 13:58:22 +03:00
Peng	8bfa8574e2	readme : update the path to bench.py (#2386 )	2024-08-28 11:45:05 +03:00
Ivo von Putzer Reibegg	376567bf4f	readme : fix typo (#2383 )	2024-08-28 11:42:18 +03:00
stormofice	c0fd64a9c0	readme : fix broken links in implementation details section (#2382 )	2024-08-28 11:41:51 +03:00
Eric Curtin	d65786ea54	readme : fix broken links (#2358 ) For whisper.cpp and whisper.h files	2024-08-20 10:57:45 +03:00
Georgi Gerganov	e30c679928	whisper : reorganize source code + improve CMake (#2256 ) * scripts : update sync [no ci] * files : reorganize [no ci] * sync : llama.cpp * cmake : link math library * cmake : build normal ggml library * files : move headers to include * objc : fix path to ggml-metal.h * ci : fix WHISPER_CUDA -> GGML_CUDA * scripts : sync LICENSE [no ci]	2024-06-26 19:34:09 +03:00
Georgi Gerganov	3b1ac03828	ggml : remove OpenCL (#0 )	2024-06-16 18:19:48 +03:00
Martin Delille	b87494bb8f	readme : add conan badge (#2196 ) * Add conan badge * Fix markdown formating	2024-05-30 15:43:28 +03:00
Carlos Zoido	ad130431aa	readme : add install instructions for Conan (#2189 )	2024-05-30 15:06:15 +03:00
Georgi Gerganov	c7b6988678	release : v1.6.2	2024-05-27 10:35:09 +03:00
Georgi Gerganov	08981d1bac	release : v1.6.0	2024-05-15 09:59:48 +03:00
AIWintermuteAI	a750868428	readme : add up-to-date repository for Python bindings (#2063 ) README	2024-04-16 14:15:52 +03:00
Georgi Gerganov	7395c70a74	release : v1.5.5	2024-04-16 14:08:31 +03:00
Georgi Gerganov	52ccd4a3a8	files : rename ./extra to ./scripts	2024-04-09 20:13:41 +03:00

1 2 3 4

194 Commits