whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-06-29 22:19:17 +02:00

Author	SHA1	Message	Date
Daniel Bevenius	c88ffbf9ba	ci : use selective copy for musa image (#3296 ) This commit modified the musa docker file to selectively copy directories needed for the container image. This commit also added a step to the docker workflow to free up disk space in attempt to make enough room for the large musa build containers. The motivation for this change is to reduce the size of the container image and try to avoid disk usage issues in CI.	2025-06-27 15:43:56 +02:00
Daniel Bevenius	7069394447	ci: set fail-fast to false in docker.yml (#3294 ) * ci: set fail-fast to false in docker.yml This commit modifies the GitHub Actions workflow for Docker builds to disable the fail-fast behavior. The motivation for this is that currently if one of the strategy jobs fails any other job that is in progress will be cancelled. There is no need for this as the jobs are independent. * ci : update docker.yml to use a single build This commit updates the docker job to only build the image once instead of twice (only happens when pushing to the master branch). Instead this will tag the image with the commit SHA when pushing to master. The motivation for this change is to reduce the time it takes to run this job and also it might help with the disk space issues we are experiencing for this job when it runs on pushes to master.	2025-06-27 09:55:56 +02:00
KITAITI Makoto	f8abbeb234	ruby : add Whisper::VERSION (#3292 ) * Add a test for segment * Check option existence * Use more proper variable to define build option * Assert Core ML enabled * Define Whisper::VERSION * Add test for Whisper::VERSION * Add signature of Whisper::VERSION	2025-06-27 04:41:26 +02:00
Daniel Bevenius	32cf4e2aba	whisper : add version function (#3289 ) * whisper : add version function This commit adds a version function to the whisper API. The motivation for this is that it might be convenient to have a way to programmatically check the version. Example usage: ```c++ printf("Using whisper version: %s\n", whisper_version()); ``` Will output: ```console Using whisper version: 1.7.6 ``` * examples : add version to android example CMakeLists.txt	2025-06-26 18:09:42 +02:00
Daniel Bevenius	35034c5aea	ci : add should_release variable (#3288 ) * ci : add should_release variable This commit adds a `should_release` variable to the GitHub Actions workflow to determine if a release should be created based on the tag or branch conditions. The motivation for this that it simplifies the logic for deciding whether to upload artifacts or not, making it easier to maintain if we need to change the conditions in the future. * ci : set release draft to true	2025-06-26 16:29:29 +02:00
toboil-features	897b071dc6	docs : add cmake "-j" flag in README.md (#3284 ) Make cmake commands encounter multithreading in README.md file.	2025-06-26 13:23:19 +02:00
Daniel Bevenius	4daf7050ca	ci : add support for tag-based releases (#3287 ) This commit modifies the GitHub Actions workflow to support tag-based releases. When a tag is pushed that starts with 'v', the workflow will use that tag name for the release process. I think this was the once the behavior, but it was lost in updates that I've made to the workflow. This commit restores that functionality.	2025-06-25 21:43:58 +02:00
Georgi Gerganov	a8d002cfd8	release : v1.7.6 v1.7.6	2025-06-25 16:47:03 +03:00
Georgi Gerganov	06bdaa6c0c	bench : update benches	2025-06-25 16:45:19 +03:00
Georgi Gerganov	dc8dda60ee	bench : print system info before ctx check	2025-06-25 16:01:32 +03:00
Daniel Bevenius	1ad258ca31	stream : add nullptr check of whisper_context (#3283 ) * stream : add nullptr check of whisper_context This commit adds a check to ensure that the `whisper_context` is not null after initialization. The motivation for this is that currently, if the initialization fails, the program continues to run leading to a segmentation fault. This sort of check is performed by others examples like whisper-cli. Refs: https://github.com/ggml-org/whisper.cpp/issues/3280#issuecomment-3003778035 * examples : add nullptr check for whisper_context	2025-06-25 14:16:31 +02:00
Daniel Bevenius	7dd2997a01	ci : enable main-cuda build (#3282 ) This commit re-enables the main-cuda Docker build in the CI workflow. The main-cuda Dockerfile has been updated to remove build artifacts and also print the size of the /app directory after the build. A similar change was recently made to the musa Dockerfile, and perhaps this job was also having similar disk space issues. The motivation for this change is that this configuration has been disabled for a while due to persistent build failures. However, the actual logs are now longer available. Resolves: https://github.com/ggml-org/whisper.cpp/issues/3040	2025-06-25 12:12:36 +02:00
Joas Dev	c85b1ae84e	bindings.java : update java example (#3281 ) This commit updates the example in the README.md file as the current Java example code is not working. Resolves: https://github.com/ggml-org/whisper.cpp/issues/2860	2025-06-25 06:35:38 +02:00
glaszig	0083335ba0	coreml : backport CoreML features to macos < 14 (#3255 )	2025-06-24 09:24:27 +02:00
Daniel Bevenius	9c47902308	ci : reduce musa image size (#3277 ) * ci : reduce musa image size This commit contains an attempt to reduce the size of the musa Docker image by copying only the necessary files from the build stage. The motivation for this is that the CI runs sometimes fail with out of memory errors. These seems to be able to pass for PRs, at least sometimes but fail upon push to the master branch. * ci : remove build time files instead of selective copying	2025-06-24 08:20:28 +02:00
Yukimasa Funaoka	a0d2c632e4	whisper : add .gitignore entries for OpenVINO support (#3276 )	2025-06-24 07:50:16 +02:00
Aaron Ang	4d6ae52ed3	command: output commands to text file (#3273 ) This commit implements code for the command line argument `-f --file FNAME` which is currently missing.	2025-06-24 06:41:21 +02:00
Daniel Bevenius	a422176937	ci : add apt-get clean to musa Dockerfile (#3275 ) * ci : add apt-get clean to musa Dockerfile This commit adds `apt-get clean` to the musa Dockerfile to reduce the image size by removing cached package files after installation. The motivation for this is to try to reduce the size of the Docker image and see if this can avoid the "no space left on device" error during the CI build process. Refs: https://github.com/ggml-org/whisper.cpp/actions/runs/15815324254	2025-06-23 12:34:44 +02:00
KITAITI Makoto	cead8f5357	ruby : specify Apple frameworks explicitly on build (#3270 ) * Add Apple frameworks to $LDFLAGS when needed * Add utility method to Options * Remove unnecessary propaty date from gemspec * Add Apple frameworks for CoreML build * Add Accelerate framework only for Apple platform * Fix ZipURI#cache signature * Download test fixtures if needed	2025-06-23 06:34:05 +02:00
Georgi Gerganov	e6c10cf3d5	talk-llama : sync llama.cpp ggml-ci	2025-06-21 07:34:17 +03:00
Georgi Gerganov	d65a579a0a	sync : ggml ggml-ci	2025-06-21 07:34:17 +03:00
Aman Gupta	b68222f92c	CUDA: add conv_2d_transpose (llama/14287) * CUDA: add conv_2d_transpose * remove direct include of cuda_fp16 * Review: add brackets for readability, remove ggml_set_param and add asserts	2025-06-21 07:34:17 +03:00
Nicolò Scipione	a455dcb04c	sycl: add usage of enqueue_functions extension (llama/14244) * Add header and namespace to use enqueue_functions extension * Convert submit and parallel_for to use new extension in convert.cpp * Convert submit and parallel_for to use extension in ggml-sycl.cpp * Convert submit and parallel_for to use extension in gla.cpp * Convert submit and parallel_for in mmq.cpp * Convert submit and parallel_for in mmvq.cpp * Convert submit and parallel_for in remaining files * Convert all simple parallel_for to nd_launch from enqueue_functions extension * Wrapping extension in general function Create a general function that enable the enqueue_functions extension if it is enable in the compiler, otherwise call the general SYCL function to launch kernels. --------- Signed-off-by: nscipione <nicolo.scipione@codeplay.com>	2025-06-21 07:34:17 +03:00
Christian Kastner	af7168174c	Implement GGML_CPU_ALL_VARIANTS for PowerPC (llama/14286) * Add PowerPC feature detection and scoring * ggml-cpu: Implement GGML_CPU_ALL_VARIANTS for PowerPC * ggml-cpu: Delay some initializations until function is called When using GGML_BACKEND_DL=ON, these initializations might use instructions that are not supported by the current CPU. --------- Co-authored-by: Diego Devesa <slarengh@gmail.com>	2025-06-21 07:34:17 +03:00
Diego Devesa	33d1f0a3e0	cuda : synchronize graph capture and cublas handle destruction (llama/14288) Workarounds an issue that may cause CUDA graph capture to fail when a cuBLAS handle is destroyed in a different thread	2025-06-21 07:34:17 +03:00
Georgi Gerganov	018b2d340e	ggml : fix repack work size for mul_mat_id (llama/14292) ggml-ci	2025-06-21 07:34:17 +03:00
Charles Xu	694f435d22	ggml: Update KleidiAI to v1.9.0 (llama/14277)	2025-06-21 07:34:17 +03:00
Aman Gupta	5efd43c956	CUDA: add conv_2d_dw (llama/14265) * CUDA: add conv_2d_dw * better naming * simplify using template * Review: fix operation ordering in ggml-cuda, use __forceinline__, use more const	2025-06-21 07:34:17 +03:00
Diego Devesa	71adde9203	ggml-cpu : remove unnecesary arm feature detection (llama/14281) Support for Arm runtime feature detection has now been added to GGML_CPU_ALL_VARIANTS. This removes the old and not very functional code.	2025-06-21 07:34:17 +03:00
fanyang	cef59c1e26	build : suppress gcc15 compile warnings (llama/14261) * Change _contains_any() substrs to std::string_view and fix the find comparison logic.	2025-06-21 07:34:17 +03:00
Anton Mitkov	a02a2d4240	sycl: Cleanup codepaths in Get Rows in sycl backend (llama/14215) Addresses unused reorder path	2025-06-21 07:34:17 +03:00
Aaron Teo	be4ea0826b	llamafile : support s390x SIMD instruction set (llama/14273)	2025-06-21 07:34:17 +03:00
0cc4m	1aca7b5c8a	Vulkan: Set device max size for host memory to avoid OOM warning and fallback to CPU buffer (llama/14249)	2025-06-21 07:34:17 +03:00
Georgi Gerganov	b251d739ad	metal : add mean kernel (llama/14267) * metal : add mean kernel ggml-ci * cont : dedup implementation ggml-ci	2025-06-21 07:34:17 +03:00
Aaron Teo	203451bcba	ggml-cpu: reduce asm calls for hsum (llama/14037) Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-06-21 07:34:17 +03:00
Aaron Teo	34940abe53	ggml-cpu: fix uncaught underscore terminators (llama/14023) Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>	2025-06-21 07:34:17 +03:00
Charles Xu	4fc9c34126	ggml: Add Apple support for GGML_CPU_ALL_VARIANTS (llama/14258)	2025-06-21 07:34:17 +03:00
Acly	471df139fa	Add `ggml_roll` (ggml/1274) * ggml : add ggml_roll * use set/get_op_params & std::min	2025-06-21 07:34:17 +03:00
Daniel Bevenius	3e65f518dd	android : update CMakeLists.txt to use FetchContent for ggml (#3268 ) * android : update CMakeLists.txt to use FetchContent for ggml This commit updates the CMakeLists.txt file for the Android Whisper example to use FetchContent for managing the ggml library. The motivation for this change is avoid having to make manual changes to the CMakeLists.txt file after syncing the ggml library. I've built and run the example locally to verify that it works as expected. Refs: https://github.com/ggml-org/whisper.cpp/pull/3265#issuecomment-2986715717 * android.java : update cmake to use FetchContent for ggml This commit updates the CMake configuration for the Android Java example to use `FetchContent` for including the `ggml` library. Do be able to use FetchContent we also update the `compileSdkVersion` and `targetSdkVersion` to 31, and the `buildToolsVersion` to '30.0.3'. This also required a an update to the Gradle plugin version to 7.4.0. The motivation for this change is avoid having to make manual changes to the CMakeLists.txt file after syncing the ggml library.	2025-06-19 16:06:42 +02:00
Georgi Gerganov	17bece1885	cmake : fix android build (#3265 ) * cmake : fix android build --------- Co-authored-by: Daniel Bevenius <daniel.bevenius@gmail.com>	2025-06-19 08:24:41 +02:00
Daniel Bevenius	ecb8f3c2b4	examples : add stereo to mono conversion in read_audio_data (#3266 ) This commit adds a conversion from stereo to mono in the `read_audio_data` function of `common-whisper.cpp`. The motivation for this change is prior to Commit `7d3da68f79` ("examples : use miniaudio for direct decoding flac, mp3, ogg and wav (#2759)", there was a step that read stereo int16 data -> pcm16 (448512 samples), and then converted to mono (224256 samples), and then also convert to stereo in `pcmf32s. The middle step here seems to have been missed when rewriting the code to use Miniaudio and caused issues then transcribing stereo audio files. For example, currently using the audio sample in the linked issue the output is: ```console [00:00:00.000 --> 00:00:03.000] (speaker 1) Sous-titres réalisés para la communauté d'Amara.org ``` And with the change in this commit the output is: ``` [00:00:00.000 --> 00:00:01.500] (speaker 1) sonnerie de téléphone [00:00:01.500 --> 00:00:07.000] (speaker 1) Salut jeune homme ! [00:00:07.000 --> 00:00:08.500] (speaker 0) C'est vrai que je te dérange ? [00:00:08.500 --> 00:00:10.500] (speaker 1) Ah pas du tout, pas du tout, pas du tout ! [00:00:10.500 --> 00:00:12.500] (speaker 1) J'étais en train de... [00:00:12.500 --> 00:00:14.500] (speaker 1) de préparer un courrier ``` Resolves: https://github.com/ggml-org/whisper.cpp/issues/3092	2025-06-18 17:41:43 +02:00
Georgi Gerganov	2f60ebc3c2	talk-llama : sync llama.cpp ggml-ci	2025-06-18 12:40:34 +03:00
Georgi Gerganov	69061e356f	sync : ggml ggml-ci	2025-06-18 12:40:34 +03:00
bandoti	0e068779c7	cmake: remove shader-gen step-targets from ggml-vulkan (llama/14226) * Remove step-targets from vulkan-shaders-gen * Unset DESTDIR when building vulkan-shaders-gen	2025-06-18 12:40:34 +03:00
xctan	ac8a303c9a	ggml-cpu : remove the weak alias trick (llama/14221)	2025-06-18 12:40:34 +03:00
R0CKSTAR	2a84593960	musa: fix build warning (unused variable) (llama/14231) Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>	2025-06-18 12:40:34 +03:00
Diego Devesa	44871c8a3e	llama : add thread safety test (llama/14035) * llama : add thread safety test * llamafile : remove global state * llama : better LLAMA_SPLIT_MODE_NONE logic when main_gpu < 0 GPU devices are not used --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>	2025-06-18 12:40:34 +03:00
bandoti	ad6cd94a3a	cmake: clean up external project logic for vulkan-shaders-gen (llama/14179) * Remove install step for vulkan-shaders-gen * Add install step to normalize msvc with make * Regenerate modified shaders at build-time	2025-06-18 12:40:34 +03:00
uvos	dbad9d8fba	HIP: disable rocwmma on gfx12 by default until rocm 7.0 (llama/14202)	2025-06-18 12:40:34 +03:00
Charles Xu	518835ee56	ggml: Add Android support for GGML_CPU_ALL_VARIANTS (llama/14206)	2025-06-18 12:40:34 +03:00

1 2 3 4 5 ...

2817 Commits