mirror of
https://github.com/ggerganov/whisper.cpp.git
synced 2025-01-22 05:48:55 +01:00
21c1e6afc5
- Slight tweaks to README for improved comprehension.
19 lines
1.4 KiB
Markdown
19 lines
1.4 KiB
Markdown
A sample SwiftUI app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.
|
|
See also: [whisper.objc](https://github.com/ggerganov/whisper.cpp/tree/master/examples/whisper.objc).
|
|
|
|
**Usage**:
|
|
|
|
1. Select a model from the [whisper.cpp repository](https://github.com/ggerganov/whisper.cpp/tree/master/models).[^1]
|
|
2. Add the model to `whisper.swiftui.demo/Resources/models` **via Xcode**.
|
|
3. Select a sample audio file (for example, [jfk.wav](https://github.com/ggerganov/whisper.cpp/raw/master/samples/jfk.wav)).
|
|
4. Add the sample audio file to `whisper.swiftui.demo/Resources/samples` **via Xcode**.
|
|
5. Select the "Release" [^2] build configuration under "Run", then deploy and run to your device.
|
|
|
|
**Note:** Pay attention to the folder path: `whisper.swiftui.demo/Resources/models` is the appropriate directory to place resources whilst `whisper.swiftui.demo/Models` is related to actual code.
|
|
|
|
[^1]: I recommend the tiny, base or small models for running on an iOS device.
|
|
|
|
[^2]: The `Release` build can boost performance of transcription. In this project, it also added `-O3 -DNDEBUG` to `Other C Flags`, but adding flags to app proj is not ideal in real world (applies to all C/C++ files), consider splitting xcodeproj in workspace in your own project.
|
|
|
|
![image](https://user-images.githubusercontent.com/1991296/212539216-0aef65e4-f882-480a-8358-0f816838fd52.png)
|