whisper.cpp/examples
Daniel Bevenius e41bc5c61a
vad : add initial Voice Activity Detection (VAD) support (#3065)
* vad : add initial Voice Activity Detection (VAD) support

This commit add support for Voice Activity Detection (VAD). When enabled
this feature will process the audio input and detect speech segments.
This information is then used to reduce the number of samples that need
to be processed by whisper_full.

Resolves: https://github.com/ggml-org/whisper.cpp/issues/3003

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
2025-05-12 16:10:11 +02:00
..
addon.node addon.node : support max_context api for addon.node (#3025) 2025-04-11 06:36:38 +02:00
bench rename : ggerganov -> ggml-org (#3005) 2025-04-04 16:11:52 +03:00
bench.wasm examples : add HEAPU8 to all of the exported runtime methods (#3134) 2025-05-10 06:44:13 +02:00
cli vad : add initial Voice Activity Detection (VAD) support (#3065) 2025-05-12 16:10:11 +02:00
command rename : ggerganov -> ggml-org (#3005) 2025-04-04 16:11:52 +03:00
command.wasm wasm : add note about worker.js file generation [no ci] (#3133) 2025-05-09 15:42:45 +02:00
deprecation-warning examples : add WHISPER_SDL2 check to deprecation executables (#2911) 2025-03-20 18:36:02 +01:00
lsp common : separate whisper sources (#2846) 2025-02-27 12:50:32 +02:00
python readme : remove invalid flag from Python example (#2396) 2024-08-30 14:00:38 +03:00
quantize whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
server whisper: remove MSVC warnings pragmas (#3090) 2025-05-05 13:09:35 +02:00
stream common : separate whisper sources (#2846) 2025-02-27 12:50:32 +02:00
stream.wasm wasm : add note about worker.js file generation [no ci] (#3133) 2025-05-09 15:42:45 +02:00
sycl sycl: fix example build (#2570) 2024-11-18 14:57:23 +02:00
talk-llama whisper: remove MSVC warnings pragmas (#3090) 2025-05-05 13:09:35 +02:00
wchess examples : add HEAPU8 to all of the exported runtime methods (#3134) 2025-05-10 06:44:13 +02:00
whisper.android examples : add new sources 2025-04-03 10:30:16 +03:00
whisper.android.java examples : add new sources 2025-04-03 10:30:16 +03:00
whisper.nvim rename : ggerganov -> ggml-org (#3005) 2025-04-04 16:11:52 +03:00
whisper.objc examples : clarify Core ML encoder model usage [no ci] (#2987) 2025-04-02 08:32:14 +02:00
whisper.swiftui examples : clarify Core ML encoder model usage [no ci] (#2987) 2025-04-02 08:32:14 +02:00
whisper.wasm wasm : add note about worker.js file generation [no ci] (#3133) 2025-05-09 15:42:45 +02:00
CMakeLists.txt examples : add dl to the list of libraries linked (#2875) 2025-03-14 04:42:20 +01:00
coi-serviceworker.js ci : add github pages workflow for wasm examples (#2969) 2025-03-31 11:34:40 +02:00
common-ggml.cpp common : remove old types 2024-12-18 12:52:16 +02:00
common-ggml.h whisper : add integer quantization support (#540) 2023-04-30 18:51:57 +03:00
common-sdl.cpp common : more general m_audio_len update logic (#2855) 2025-03-07 10:10:03 +02:00
common-sdl.h sdl : fix audio callback (#1523) 2023-11-20 13:16:38 +02:00
common-whisper.cpp whisper: remove MSVC warnings pragmas (#3090) 2025-05-05 13:09:35 +02:00
common-whisper.h common : separate whisper sources (#2846) 2025-02-27 12:50:32 +02:00
common.cpp whisper: remove MSVC warnings pragmas (#3090) 2025-05-05 13:09:35 +02:00
common.h examples : update link to Paul Tol's color scheme [no ci] (#3140) 2025-05-12 09:02:06 +02:00
ffmpeg-transcode.cpp examples : fix deprecated FFmpeg functions (#3073) 2025-04-28 06:16:50 +02:00
generate-karaoke.sh examples : use miniaudio for direct decoding flac, mp3, ogg and wav (#2759) 2025-02-27 09:06:54 +02:00
grammar-parser.cpp whisper : reorganize source code + improve CMake (#2256) 2024-06-26 19:34:09 +03:00
grammar-parser.h whisper : add grammar-based sampling (#1229) 2023-11-13 10:51:34 +02:00
helpers.js js : remove un-needed request header from fetchRemote (#2119) 2024-05-13 15:13:19 +03:00
json.hpp examples : clean up common code (#1871) 2024-02-19 10:50:15 +02:00
livestream.sh rename : ggerganov -> ggml-org (#3005) 2025-04-04 16:11:52 +03:00
miniaudio.h examples : use miniaudio for direct decoding flac, mp3, ogg and wav (#2759) 2025-02-27 09:06:54 +02:00
server.py examples : update server.py to match github pages app [no ci] (#3004) 2025-04-04 10:23:53 +02:00
stb_vorbis.c examples : use miniaudio for direct decoding flac, mp3, ogg and wav (#2759) 2025-02-27 09:06:54 +02:00
twitch.sh rename : ggerganov -> ggml-org (#3005) 2025-04-04 16:11:52 +03:00
yt-wsp.sh rename : ggerganov -> ggml-org (#3005) 2025-04-04 16:11:52 +03:00