Update README.md (ref #50)

This commit is contained in:
Georgi Gerganov 2022-10-15 09:40:08 +03:00 committed by GitHub
parent b2f1600aa3
commit 36945162fa
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -19,7 +19,7 @@ High-performance inference of [OpenAI's Whisper](https://github.com/openai/whisp
To build the main program, run `make`. You can then transcribe a `.wav` file like this:
```bash
$ ./main -f input.wav
./main -f input.wav
```
Before running the program, make sure to download one of the ggml Whisper models. For example:
@ -216,11 +216,23 @@ The `stream` tool samples the audio every half a second and runs the transcripti
More info is available in [issue #10](https://github.com/ggerganov/whisper.cpp/issues/10).
```java
$ ./stream -m ./models/ggml-base.en.bin -t 8 --step 500 --length 5000
./stream -m ./models/ggml-base.en.bin -t 8 --step 500 --length 5000
```
https://user-images.githubusercontent.com/1991296/194935793-76afede7-cfa8-48d8-a80f-28ba83be7d09.mp4
The `stream` tool depends on SDL2 library to capture audio from the microphone. You can build it like this:
```bash
# Install SDL2 on Linux
sudo apt-get install libsdl2-dev
# Install SDL2 on Mac OS
brew install sdl2
make stream
```
## Implementation details
- The core tensor operations are implemented in C ([ggml.h](ggml.h) / [ggml.c](ggml.c))