whisper.cpp/examples at e41bc5c61ae66af6be2bd7011769bb821a83e8ae - whisper.cpp - Gitea: Git with a cup of tea

extern/whisper.cpp

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-07-28 21:42:58 +02:00

Files

History

Daniel Bevenius e41bc5c61a vad : add initial Voice Activity Detection (VAD) support (#3065 )

* vad : add initial Voice Activity Detection (VAD) support

This commit add support for Voice Activity Detection (VAD). When enabled
this feature will process the audio input and detect speech segments.
This information is then used to reduce the number of samples that need
to be processed by whisper_full.

Resolves: https://github.com/ggml-org/whisper.cpp/issues/3003

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

2025-05-12 16:10:11 +02:00

..

addon.node : support max_context api for addon.node (#3025 )

2025-04-11 06:36:38 +02:00

rename : ggerganov -> ggml-org (#3005 )

2025-04-04 16:11:52 +03:00

examples : add HEAPU8 to all of the exported runtime methods (#3134 )

2025-05-10 06:44:13 +02:00

vad : add initial Voice Activity Detection (VAD) support (#3065 )

2025-05-12 16:10:11 +02:00

rename : ggerganov -> ggml-org (#3005 )

2025-04-04 16:11:52 +03:00

wasm : add note about worker.js file generation [no ci] (#3133 )

2025-05-09 15:42:45 +02:00

deprecation-warning

examples : add WHISPER_SDL2 check to deprecation executables (#2911 )

2025-03-20 18:36:02 +01:00

common : separate whisper sources (#2846 )

2025-02-27 12:50:32 +02:00

readme : remove invalid flag from Python example (#2396 )

2024-08-30 14:00:38 +03:00

whisper : reorganize source code + improve CMake (#2256 )

2024-06-26 19:34:09 +03:00

whisper: remove MSVC warnings pragmas (#3090 )

2025-05-05 13:09:35 +02:00

common : separate whisper sources (#2846 )

2025-02-27 12:50:32 +02:00

wasm : add note about worker.js file generation [no ci] (#3133 )

2025-05-09 15:42:45 +02:00

sycl: fix example build (#2570 )

2024-11-18 14:57:23 +02:00

whisper: remove MSVC warnings pragmas (#3090 )

2025-05-05 13:09:35 +02:00

examples : add HEAPU8 to all of the exported runtime methods (#3134 )

2025-05-10 06:44:13 +02:00

whisper.android

examples : add new sources

2025-04-03 10:30:16 +03:00

whisper.android.java

examples : add new sources

2025-04-03 10:30:16 +03:00

rename : ggerganov -> ggml-org (#3005 )

2025-04-04 16:11:52 +03:00

examples : clarify Core ML encoder model usage [no ci] (#2987 )

2025-04-02 08:32:14 +02:00

whisper.swiftui

examples : clarify Core ML encoder model usage [no ci] (#2987 )

2025-04-02 08:32:14 +02:00

wasm : add note about worker.js file generation [no ci] (#3133 )

2025-05-09 15:42:45 +02:00

CMakeLists.txt

examples : add dl to the list of libraries linked (#2875 )

2025-03-14 04:42:20 +01:00

coi-serviceworker.js

ci : add github pages workflow for wasm examples (#2969 )

2025-03-31 11:34:40 +02:00

common-ggml.cpp

common : remove old types

2024-12-18 12:52:16 +02:00

common-ggml.h

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

common-sdl.cpp

common : more general m_audio_len update logic (#2855 )

2025-03-07 10:10:03 +02:00

common-sdl.h

sdl : fix audio callback (#1523 )

2023-11-20 13:16:38 +02:00

common-whisper.cpp

whisper: remove MSVC warnings pragmas (#3090 )

2025-05-05 13:09:35 +02:00

common-whisper.h

common : separate whisper sources (#2846 )

2025-02-27 12:50:32 +02:00

common.cpp

whisper: remove MSVC warnings pragmas (#3090 )

2025-05-05 13:09:35 +02:00

common.h

examples : update link to Paul Tol's color scheme [no ci] (#3140 )

2025-05-12 09:02:06 +02:00

ffmpeg-transcode.cpp

examples : fix deprecated FFmpeg functions (#3073 )

2025-04-28 06:16:50 +02:00

generate-karaoke.sh

examples : use miniaudio for direct decoding flac, mp3, ogg and wav (#2759 )

2025-02-27 09:06:54 +02:00

grammar-parser.cpp

whisper : reorganize source code + improve CMake (#2256 )

2024-06-26 19:34:09 +03:00

grammar-parser.h

whisper : add grammar-based sampling (#1229 )

2023-11-13 10:51:34 +02:00

helpers.js

js : remove un-needed request header from fetchRemote (#2119 )

2024-05-13 15:13:19 +03:00

json.hpp

examples : clean up common code (#1871 )

2024-02-19 10:50:15 +02:00

livestream.sh

rename : ggerganov -> ggml-org (#3005 )

2025-04-04 16:11:52 +03:00

miniaudio.h

examples : use miniaudio for direct decoding flac, mp3, ogg and wav (#2759 )

2025-02-27 09:06:54 +02:00

server.py

examples : update server.py to match github pages app [no ci] (#3004 )

2025-04-04 10:23:53 +02:00

stb_vorbis.c

examples : use miniaudio for direct decoding flac, mp3, ogg and wav (#2759 )

2025-02-27 09:06:54 +02:00

twitch.sh

rename : ggerganov -> ggml-org (#3005 )

2025-04-04 16:11:52 +03:00

yt-wsp.sh

rename : ggerganov -> ggml-org (#3005 )

2025-04-04 16:11:52 +03:00