extern/whisper.cpp

Fork 1

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-08-11 14:24:30 +02:00

Files

.devops

.github

bindings

cmake

coreml

examples

addon.node

bench

bench.wasm

command

command.wasm

lsp

main

python

quantize

server

stream

stream.wasm

sycl

talk

.gitignore

CMakeLists.txt

README.md

eleven-labs.py

gpt-2.cpp

gpt-2.h

speak

speak.bat

speak.ps1

talk.cpp

talk-llama

talk.wasm

wchess

whisper.android

whisper.android.java

whisper.nvim

whisper.objc

whisper.swiftui

whisper.wasm

CMakeLists.txt

common-ggml.cpp

common-ggml.h

common-sdl.cpp

common-sdl.h

common.cpp

common.h

dr_wav.h

generate-karaoke.sh

grammar-parser.cpp

grammar-parser.h

helpers.js

json.hpp

livestream.sh

twitch.sh

yt-wsp.sh

ggml-cuda

grammars

models

openvino

samples

scripts

spm-headers

tests

.gitignore

.gitmodules

AUTHORS

CMakeLists.txt

LICENSE

Makefile

Package.swift

README.md

README_sycl.md

ggml-alloc.c

ggml-alloc.h

ggml-backend-impl.h

ggml-backend.c

ggml-backend.h

ggml-common.h

ggml-cuda.cu

ggml-cuda.h

ggml-impl.h

ggml-kompute.cpp

ggml-kompute.h

ggml-metal.h

ggml-metal.m

ggml-metal.metal

ggml-opencl.cpp

ggml-opencl.h

ggml-quants.c

ggml-quants.h

ggml-sycl.cpp

ggml-sycl.h

ggml-vulkan.cpp

ggml-vulkan.h

ggml.c

ggml.h

whisper.cpp

whisper.h

History

Mohammadreza Hendiani 04e48094e4 readme : add Fedora dependencies (#1970 )

* README.md

fix documentaion and added fedora liunx dependencies for stream build

* fix documentaion and added fedora liunx dependencies for command build

* fix documentaion and added fedora liunx dependencies for talk build

* fix documentaion and added fedora liunx dependencies for talk-llama build

* reverted back mistakenly removed MacOS documentaion

2024-03-20 18:42:11 +02:00

.gitignore

talk, talk-llama : pass text_to_speak as a file (#1865 )

2024-02-24 09:24:47 +02:00

CMakeLists.txt

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

eleven-labs.py

talk, talk-llama : pass text_to_speak as a file (#1865 )

2024-02-24 09:24:47 +02:00

gpt-2.cpp

sync : ggml (ggml_scale, ggml_row_size, etc.) (#1677 )

2023-12-22 17:53:39 +02:00

gpt-2.h

whisper : add integer quantization support (#540 )

2023-04-30 18:51:57 +03:00

README.md

readme : add Fedora dependencies (#1970 )

2024-03-20 18:42:11 +02:00

speak

talk, talk-llama : pass text_to_speak as a file (#1865 )

2024-02-24 09:24:47 +02:00

speak.bat

speak scripts for Windows

2023-06-01 22:45:00 +10:00

speak.ps1

talk, talk-llama : pass text_to_speak as a file (#1865 )

2024-02-24 09:24:47 +02:00

talk.cpp

talk, talk-llama : pass text_to_speak as a file (#1865 )

2024-02-24 09:24:47 +02:00

README.md

talk

Talk with an Artificial Intelligence in your terminal

Demo Talk

Web version: examples/talk.wasm

Building

The talk tool depends on SDL2 library to capture audio from the microphone. You can build it like this:

# Install SDL2
# On Debian based linux distributions:
sudo apt-get install libsdl2-dev

# On Fedora Linux:
sudo dnf install SDL2 SDL2-devel

# Install SDL2 on Mac OS
brew install sdl2

# Build the "talk" executable
make talk

# Run it
./talk -p Santa

GPT-2

To run this, you will need a ggml GPT-2 model: instructions

Alternatively, you can simply download the smallest ggml GPT-2 117M model (240 MB) like this:

wget --quiet --show-progress -O models/ggml-gpt-2-117M.bin https://huggingface.co/ggerganov/ggml/resolve/main/ggml-model-gpt-2-117M.bin

TTS

For best experience, this example needs a TTS tool to convert the generated text responses to voice. You can use any TTS engine that you would like - simply edit the speak script to your needs. By default, it is configured to use MacOS's say or espeak or Windows SpeechSynthesizer, but you can use whatever you wish.