From 13d92d08ae26031545921243256aaaf0ee057943 Mon Sep 17 00:00:00 2001 From: KITAITI Makoto Date: Fri, 23 May 2025 17:38:26 +0900 Subject: [PATCH] docs : fix VAD section heading levels (#3186) --- README.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 8b010a72..44ebc41e 100644 --- a/README.md +++ b/README.md @@ -733,7 +733,7 @@ let package = Package( ) ``` -### Voice Activity Detection (VAD) +## Voice Activity Detection (VAD) Support for Voice Activity Detection (VAD) can be enabled using the `--vad` argument to `whisper-cli`. In addition to this option a VAD model is also required. @@ -747,7 +747,7 @@ transcription process. The following VAD models are currently supported: -#### Silero-VAD +### Silero-VAD [Silero-vad](https://github.com/snakers4/silero-vad) is a lightweight VAD model written in Python that is fast and accurate. @@ -792,7 +792,7 @@ $ ./build/bin/whisper-cli \ --vad-model ./models/silero-v5.1.2-ggml.bin ``` -#### VAD Options +### VAD Options * --vad-threshold: Threshold probability for speech detection. A probability for a speech segment/frame above this threshold will be considered as speech.