whisper.cpp

extern/whisper.cpp

Fork 1

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-01-23 14:28:56 +01:00

Commit Graph

Select branches

Hide Pull Requests

arghh

avx512

batched

bench-memcpy

chess

coreml-with-state

cuda-cublas-opts

diarization

distil-support

experiment/model-compression

fa-decoder

feature/debug-gradle-signing

fix-bench

fix-coreml-ane

fix-vzip

gg/alloc-enc-results

gg/chess

gg/ci-cuda-fix

gg/ci-fix-android

gg/ci-fix-windows

gg/cuda-fix-mmvq

gg/cuda-no-async

gg/disable-cuda-graphs

gg/fix-external-encoder

gg/hipblas-fix

gg/objc

gg/prompt-tokens

gg/reduce-ctx-use

gg/wchess

ggml-backend

ggml-backend-no-sched

ggml-conv

grammar-debug

guided

java-bindings

large-v3

llama-podcast

macros-cvt-fp16

master

metal

metal-and-alloc

nvblas

parallel-states

quantize-encoder

stream

talk.llama-coreml

threads

timing

try-fix-abort

word-ts-2

#1001

#1002

#1003

#1010

#1012

#1015

#102

#1021

#1021

#1024

#1027

#1029

#1031

#1032

#1034

#1037

#1041

#1042

#1045

#1046

#1049

#1054

#1058

#1060

#1062

#1064

#1067

#107

#1074

#1074

#1077

#1081

#1086

#1086

#1092

#1097

#1097

#110

#1101

#111

#1110

#1111

#1112

#1113

#1114

#1115

#1118

#1118

#1120

#1124

#1128

#1129

#1130

#1131

#1134

#1136

#1137

#114

#1142

#1143

#1144

#1147

#1148

#115

#1154

#116

#1160

#1162

#1164

#1164

#1173

#1174

#1196

#1204

#1205

#1209

#121

#1210

#1211

#1212

#1214

#1216

#1217

#1218

#1220

#1224

#1227

#1228

#1229

#123

#1231

#1235

#1238

#124

#1243

#1247

#1250

#1251

#1253

#1254

#1255

#1261

#1261

#1263

#1264

#1265

#1267

#127

#127

#1270

#1275

#128

#1286

#1290

#1293

#1294

#1298

#130

#130

#1303

#1304

#1305

#1306

#131

#1310

#1313

#1317

#1330

#1334

#1335

#1345

#1349

#135

#1350

#1352

#1356

#1358

#136

#1362

#1364

#1368

#1370

#1375

#1375

#1380

#1381

#1381

#1382

#1389

#1400

#1404

#141

#1415

#1417

#1418

#1418

#1420

#1422

#1424

#143

#1432

#1434

#1440

#1441

#1442

#1444

#1445

#1452

#1455

#1455

#1456

#1457

#1458

#1459

#1462

#1466

#1467

#147

#1472

#1473

#1474

#1475

#1478

#1478

#1479

#1484

#1485

#1486

#1487

#1492

#1493

#1499

#1499

#150

#1500

#1500

#1501

#1505

#1519

#1521

#1522

#1523

#1524

#1524

#1529

#1530

#1533

#1534

#1535

#1539

#1541

#1544

#1545

#1546

#1547

#1548

#1549

#1549

#155

#1551

#1554

#1559

#1559

#1560

#1561

#1563

#1563

#1565

#1567

#1568

#1574

#1575

#1576

#1578

#1582

#1583

#1586

#1588

#1589

#1595

#160

#1602

#1604

#1604

#1605

#1606

#1607

#1615

#1617

#1627

#1627

#163

#1633

#1649

#1649

#1650

#1651

#1655

#1658

#1667

#1669

#1672

#1673

#1674

#1675

#1677

#1679

#1679

#1681

#1691

#1692

#1694

#1695

#170

#1701

#1703

#1704

#1713

#1714

#1716

#1717

#1725

#1727

#1728

#1729

#1735

#174

#1740

#1741

#1744

#1747

#1749

#175

#1750

#1753

#1754

#1755

#1758

#1763

#1764

#1765

#1768

#1768

#1772

#1774

#1778

#1781

#1785

#179

#1791

#1791

#1792

#1802

#1806

#1809

#1812

#1813

#1819

#1823

#1823

#183

#1833

#1833

#1838

#1839

#1840

#1841

#1841

#1842

#1850

#1854

#1854

#1857

#1859

#1860

#1861

#1863

#1865

#1871

#1872

#1874

#1878

#1888

#1889

#1890

#1891

#1895

#1897

#19

#1902

#1913

#1913

#1917

#1924

#1924

#1925

#1926

#1928

#1929

#193

#1932

#1933

#1938

#194

#1942

#1943

#1944

#1945

#1947

#195

#1952

#1952

#1953

#1964

#1965

#1966

#1969

#1969

#1970

#1973

#1973

#1978

#1978

#1980

#1981

#1982

#1983

#1990

#1990

#1994

#1997

#1998

#20

#2000

#2001

#2004

#2005

#2005

#201

#2012

#2019

#2020

#2024

#2025

#2026

#203

#203

#2043

#2044

#2045

#2048

#2049

#2054

#2058

#2063

#2068

#2068

#2069

#2070

#2071

#2071

#2072

#2073

#2075

#2075

#2080

#2086

#2088

#2090

#2094

#2095

#2095

#21

#2100

#2102

#2108

#2115

#2119

#2121

#2123

#2127

#2127

#2128

#2129

#2133

#2138

#2142

#2152

#2153

#2154

#2166

#2170

#2181

#2182

#2184

#2184

#2189

#2194

#2196

#2198

#2206

#2208

#2217

#222

#2220

#2227

#2231

#2232

#2234

#2235

#2236

#2237

#2238

#2239

#224

#2240

#2242

#2254

#2254

#2256

#2261

#2264

#2266

#2267

#2270

#2272

#2272

#2279

#2279

#228

#2288

#229

#2290

#2291

#2291

#2294

#2299

#23

#230

#2302

#231

#2311

#2324

#2330

#2330

#2336

#2339

#2339

#2342

#2343

#2346

#2350

#2358

#2360

#2367

#2369

#2369

#2376

#2376

#2382

#2383

#2384

#2384

#2386

#2387

#239

#2391

#2393

#2396

#24

#2401

#2406

#2406

#2407

#2410

#2414

#2416

#2417

#2419

#2424

#2425

#2427

#2429

#2431

#2432

#2432

#2433

#2440

#2443

#2444

#2449

#245

#2451

#2455

#2464

#2475

#2477

#2481

#2484

#2485

#2488

#2489

#2495

#2505

#2506

#2511

#2515

#2516

#2517

#2518

#2519

#252

#2523

#2525

#2528

#2529

#253

#2534

#254

#2543

#2546

#2547

#2548

#2549

#2550

#2551

#2555

#2560

#2560

#2561

#2562

#2567

#2569

#2569

#257

#2570

#2573

#2574

#2576

#2576

#2577

#2577

#2579

#2579

#2580

#2585

#2589

#2593

#2593

#260

#2604

#2608

#2611

#2613

#2617

#2623

#2624

#2625

#2629

#2633

#2634

#2634

#2635

#2637

#2638

#2639

#2641

#2642

#2643

#2648

#2649

#2653

#2654

#2656

#2659

#2663

#2664

#2670

#2674

#2676

#2683

#2684

#2684

#2686

#2687

#2690

#2690

#2691

#2691

#2692

#2693

#2694

#2694

#2699

#27

#2700

#2707

#2709

#271

#2711

#2716

#2718

#2728

#273

#2734

#2736

#2737

#274

#2745

#2749

#2756

#2756

#277

#28

#282

#284

#284

#285

#286

#287

#288

#29

#291

#294

#296

#298

#299

#3

#301

#302

#306

#308

#31

#317

#318

#319

#320

#322

#323

#324

#331

#336

#34

#340

#343

#343

#345

#346

#349

#350

#351

#353

#357

#359

#36

#362

#365

#366

#368

#369

#379

#38

#381

#383

#384

#387

#388

#390

#391

#398

#404

#409

#41

#415

#42

#424

#425

#43

#431

#435

#436

#439

#443

#444

#446

#451

#453

#454

#454

#455

#456

#459

#461

#462

#468

#473

#474

#476

#482

#484

#485

#486

#494

#495

#497

#500

#501

#502

#502

#503

#506

#515

#520

#523

#532

#534

#537

#538

#540

#542

#552

#563

#566

#569

#572

#576

#58

#583

#60

#600

#605

#613

#613

#615

#619

#624

#624

#626

#627

#628

#629

#629

#638

#640

#642

#645

#648

#649

#650

#650

#659

#659

#664

#668

#67

#677

#682

#685

#686

#687

#688

#697

#70

#704

#706

#710

#711

#712

#716

#718

#72

#720

#721

#725

#728

#733

#737

#739

#740

#755

#759

#760

#763

#764

#768

#77

#776

#78

#798

#81

#810

#811

#812

#815

#816

#832

#833

#834

#835

#836

#837

#842

#845

#853

#854

#862

#863

#867

#87

#871

#871

#874

#875

#883

#885

#890

#891

#891

#893

#899

#902

#908

#910

#915

#926

#927

#931

#935

#939

#939

#94

#944

#95

#956

#964

#968

#968

#971

#971

#972

#995

0.0.5-3

0.0.6-1

1.0.3

1.0.4

1.1.0

1.4.1-1

1.4.1-2

1.5.2

v1.0.4

v1.1.0

v1.1.1

v1.2.0

v1.2.1

v1.3.0

v1.4.0

v1.4.1

v1.4.2

v1.4.3

v1.5.0

v1.5.1

v1.5.2

v1.5.3

v1.5.4

v1.5.5

v1.6.0

v1.6.1

v1.6.2

v1.7.0

v1.7.1

v1.7.2

v1.7.2-pre

v1.7.3

v1.7.3-pre

v1.7.4

v1.7.4-pre-0

v1.7.4-pre-1

1f14567ee6

ggml : do not define GGML_USE_CUDA when building with GGML_BACKEND_DL (llama/11211) Radoslav Gerganov 2025-01-13 13:31:41 +0200
618d94abb4

Vulkan: Fix float16 use on devices without float16 support + fix subgroup_size_control validation error (llama/11161) 0cc4m 2025-01-10 06:39:33 +0100
2d6f599774

llama: add support for QRWKV6 model architecture (llama/11001) Molly Sophia 2025-01-10 09:58:08 +0800
fe7bb8849d

SYCL: Refactor ggml_sycl_compute_forward (llama/11121) Akarshan Biswas 2025-01-10 05:43:03 +0530
40aa3fa643

fix: add missing msg in static_assert (llama/11143) hydai 2025-01-09 04:03:28 +0800
3272320d98

llamafile : ppc64le MMA INT8 implementation (llama/10912) amritahs-ibm 2025-01-08 16:24:19 +0530
e322e918a3

Disable GL_KHR_cooperative_matrix Vulkan extension if not available. (llama/11117) Mathieu Baudier 2025-01-08 09:18:13 +0100
f2031c56c2

fix: Vulkan shader gen binary path when Cross-compiling (llama/11096) ag2s20150909 2025-01-08 16:17:29 +0800
a48be79914

GGUF: C++ refactor, backend support, misc fixes (llama/11030) Johannes Gäßler 2025-01-07 18:01:58 +0100
6679500ba3

ggml-backend : only offload from host buffers (fix) (llama/11124) Diego Devesa 2025-01-07 16:11:57 +0100
c324f37090

ggml-backend : only offload from host buffers (llama/11120) Diego Devesa 2025-01-07 12:38:05 +0100
f0783516ac

rpc : code cleanup (llama/11107) Radoslav Gerganov 2025-01-07 08:37:02 +0200
c52b2f6d50

SYCL: Use get_multi_ptr instead of deprecated get_pointer in wkv6 (llama/11087) Akarshan Biswas 2025-01-07 11:56:07 +0530
9325f4af05

CUDA: add BF16 support (llama/11093) Johannes Gäßler 2025-01-06 02:33:52 +0100
fbe4db4881

Vulkan: Add device-specific blacklist for coopmat for the AMD proprietary driver (llama/11074) 0cc4m 2025-01-04 21:09:59 +0100
aababf16c8

Support for models with non-512-aligned tensors over RPC. (llama/11047) matt23654 2025-01-04 16:10:30 +0000
95cd1d3276

fix: Vulkan shader gen binary path (llama/11037) Gilad S. 2025-01-04 10:17:31 +0200
19c147b26d

ggml : allow loading backend with env variable (ggml/1059) Radoslav Gerganov 2025-01-05 09:50:37 +0200
507e230f1e

scripts : sync opencl, gguf Georgi Gerganov 2025-01-14 09:42:16 +0200
3037f1d5ee Update whisper.objc xcode project with files that have been removed and add new files that were missing Corey Earwood 2025-01-13 22:19:12 -0700
3915a1b1f4 Disable GL_KHR_cooperative_matrix Vulkan extension if not available Pepijn de Vos 2025-01-13 22:01:56 +0100
eb68324c86

whisper : fix gpu device selection (#2728) Georgi Gerganov 2025-01-13 13:11:37 +0200
c719c5be54

whisper : fix gpu device selection Georgi Gerganov 2025-01-13 09:56:32 +0200
e940fbf283

server : fix build (#2718) Georgi Gerganov 2025-01-13 08:57:33 +0200
35d0e02c72

talk-llama : sync llama.cpp (#2709) Georgi Gerganov 2025-01-13 08:55:48 +0200
45d3faf961

server : generate unique tmp filenames (#2718) NETZkultur GmbH 2025-01-13 07:55:21 +0100
f1fcab6eca

Merge 7b7c9eb005 into 2ab2eb5110 Don Mahurin 2025-01-10 02:38:36 +0100
6feb5f2690

Use Unique Filenames for FFmpeg Conversion to Prevent File Overwrites NETZkultur GmbH 2025-01-09 15:48:55 +0100
2ab2eb5110

whisper : add whisper_full_get_segment_no_speech_prob_from_state (#2716) Sandro Hanea 2025-01-09 15:21:07 +0100
e18231285e Exposed whisper_full_get_segment_no_speech_prob_from_state in addition to context based retrieval Sandro Hanea 2025-01-09 11:46:24 +0000
b82d305282

readme : add docker instructions (#2711) Jayant 2025-01-07 12:20:51 +0100
744c2c431e

Adding back docker instructions in v1.7.4 Jayant 2025-01-06 17:49:16 +0100
9d2aafa153

talk-llama : sync llama.cpp Georgi Gerganov 2025-01-06 15:25:33 +0200
885e31368d

docs: Fix main -> whisper-cli in download scripts (#2707) Adam Jones 2025-01-06 13:17:57 +0000
8a9ad7844d

release : v1.7.4 v1.7.4 Georgi Gerganov 2025-01-06 15:13:48 +0200
eb874b3a3c

ci : cont Georgi Gerganov 2025-01-06 10:46:10 +0200
eb78e3a3f1

ci : fix ubuntu runner names Georgi Gerganov 2025-01-06 09:29:10 +0200
d1a467f30a docs: Fix main -> whisper-cli in download scripts Adam Jones 2025-01-06 01:45:22 +0000
f99263e420 Run vad_simple on entire pcmf32, not on the last step Tamotsu Takahashi 2025-01-05 08:47:22 +0900
ece3ff88f6

cli : fix segfault on missing argument (#2700) Yusuf Redžić 2025-01-04 09:47:41 +0100
9366544991 ci : fix arm builds Georgi Gerganov 2025-01-03 16:24:02 +0200
95583942ed sync : ggml Georgi Gerganov 2025-01-03 14:11:23 +0200
2e93cb6a2f ggml : do not install metal source when embed library (ggml/1054) Georgi Gerganov 2025-01-03 14:11:20 +0200
de5cd60d1c metal : avoid uint (llama/11019) Georgi Gerganov 2025-01-03 11:26:14 +0200
3fcba3e58b ggml : fixes for AVXVNNI instruction set with MSVC and Clang (llama/11027) Srihari-mcw 2024-12-31 19:53:33 +0530
cea5f1c52f vulkan: optimize mul_mat for small values of N (llama/10991) Jeff Bolz 2024-12-30 11:27:11 -0600
2112462db4 vulkan: im2col and matmul optimizations for stable diffusion (llama/10942) Jeff Bolz 2024-12-29 03:16:34 -0600
fc84ecd445 vulkan: Use push constant offset to handle misaligned descriptors (llama/10987) Jeff Bolz 2024-12-29 02:35:11 -0600
8de1e99907 vulkan: multi-row k quants (llama/10846) Eve 2024-12-26 10:54:44 -0500
499af9294a examples, ggml : fix GCC compiler warnings (llama/10983) Peter 2024-12-27 00:59:11 +1100
bcf937c216 ggml : more perfo with llamafile tinyblas on x86_64 (llama/10714) Djip007 2024-12-24 18:54:49 +0100
b8d90953d7 ggml : use wstring for backend search paths (llama/10960) Diego Devesa 2024-12-24 04:05:27 +0100
60a422147b ggml : fix arm enabled features check (llama/10961) Diego Devesa 2024-12-24 04:05:17 +0100
3387415bad ggml : fix const usage in SSE path (llama/10962) Diego Devesa 2024-12-23 20:25:52 +0100
536ca3ec89 ggml : fix run-time on FreeBSD in get_executable_path() (llama/10948) yuri@FreeBSD 2024-12-22 16:20:11 -0800
a4bb983190 vulkan: build fixes for 32b (llama/10927) Jeff Bolz 2024-12-22 03:44:01 -0600
39c205f555 vulkan: optimize coopmat2 dequant functions (llama/10855) Jeff Bolz 2024-12-21 01:04:45 -0600
6d502f33dc ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (llama/10874) Adrien Gallouët 2024-12-21 00:33:37 +0100
5ea27d089d SYCL: Migrate away from deprecated ggml_tensor->backend (llama/10840) Akarshan Biswas 2024-12-20 21:01:28 +0530
1462d92588 ggml : add test for SVE and disable when it fails (llama/10906) Diego Devesa 2024-12-20 13:31:28 +0100
7ba1a41f47 ggml: fix arm build with gcc (llama/10895) Adrien Gallouët 2024-12-19 14:20:41 +0100
5ea088636f ggml : fix arm build (llama/10890) Diego Devesa 2024-12-18 23:21:42 +0100
f32ddb3b1c tts : add OuteTTS support (llama/10784) Georgi Gerganov 2024-12-18 19:27:21 +0200
79b75ece03 tests: add tests for GGUF (llama/10830) Johannes Gäßler 2024-12-17 19:09:35 +0100
6348d73e55 ggml : improve inputs log sched_print_assignments (ggml/1053) Daniel Bevenius 2024-12-19 03:50:12 +0100
5e60c6de3b

ci : fix arm builds Georgi Gerganov 2025-01-03 16:24:02 +0200
09d49febbf cli : fix segfault on missing argument Yusuf Redzic 2025-01-03 17:48:43 +0100
1deb9a6151

sync : ggml Georgi Gerganov 2025-01-03 14:11:23 +0200
0024db855b

ggml : do not install metal source when embed library (ggml/1054) Georgi Gerganov 2025-01-03 14:11:20 +0200
6a0441b002

metal : avoid uint (llama/11019) Georgi Gerganov 2025-01-03 11:26:14 +0200
11d52f5d35

ggml : fixes for AVXVNNI instruction set with MSVC and Clang (llama/11027) Srihari-mcw 2024-12-31 19:53:33 +0530
b9ca755a59

vulkan: optimize mul_mat for small values of N (llama/10991) Jeff Bolz 2024-12-30 11:27:11 -0600
c43ba3787f

vulkan: im2col and matmul optimizations for stable diffusion (llama/10942) Jeff Bolz 2024-12-29 03:16:34 -0600
980d41ec7a

vulkan: Use push constant offset to handle misaligned descriptors (llama/10987) Jeff Bolz 2024-12-29 02:35:11 -0600
aa0887e8af

vulkan: multi-row k quants (llama/10846) Eve 2024-12-26 10:54:44 -0500
55a5cf35a2

examples, ggml : fix GCC compiler warnings (llama/10983) Peter 2024-12-27 00:59:11 +1100
fbaadc216d

ggml : more perfo with llamafile tinyblas on x86_64 (llama/10714) Djip007 2024-12-24 18:54:49 +0100
3edc4b0db1

ggml : use wstring for backend search paths (llama/10960) Diego Devesa 2024-12-24 04:05:27 +0100
6d27ca5bb7

ggml : fix arm enabled features check (llama/10961) Diego Devesa 2024-12-24 04:05:17 +0100
ac6af9e766

ggml : fix const usage in SSE path (llama/10962) Diego Devesa 2024-12-23 20:25:52 +0100
b1151f92a0

ggml : fix run-time on FreeBSD in get_executable_path() (llama/10948) yuri@FreeBSD 2024-12-22 16:20:11 -0800
6c01a1eb4c

vulkan: build fixes for 32b (llama/10927) Jeff Bolz 2024-12-22 03:44:01 -0600
0914a27c53

vulkan: optimize coopmat2 dequant functions (llama/10855) Jeff Bolz 2024-12-21 01:04:45 -0600
18bfd3181f

ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (llama/10874) Adrien Gallouët 2024-12-21 00:33:37 +0100
f86cf2b1e1

SYCL: Migrate away from deprecated ggml_tensor->backend (llama/10840) Akarshan Biswas 2024-12-20 21:01:28 +0530
011ca37a19

ggml : add test for SVE and disable when it fails (llama/10906) Diego Devesa 2024-12-20 13:31:28 +0100
58d1a1b4b9

ggml: fix arm build with gcc (llama/10895) Adrien Gallouët 2024-12-19 14:20:41 +0100
7927fce84a

ggml : fix arm build (llama/10890) Diego Devesa 2024-12-18 23:21:42 +0100
3aa63a8c2f

tts : add OuteTTS support (llama/10784) Georgi Gerganov 2024-12-18 19:27:21 +0200
0d4d69a9cc

tests: add tests for GGUF (llama/10830) Johannes Gäßler 2024-12-17 19:09:35 +0100
a371a941fb

ggml : improve inputs log sched_print_assignments (ggml/1053) Daniel Bevenius 2024-12-19 03:50:12 +0100
0a84581f20 Make stream more test-friendly Tamotsu Takahashi 2025-01-03 15:28:47 +0900
fb36a1538a

readme : fix real-time audio input example build instructions (#2692) Samuel Durante 2025-01-02 07:05:38 -0300
c81b8b910b

objc : rename ggml-cpu-aarch64.c to .cpp (#2687) Alter 2025-01-02 10:05:09 +0000
85b60f31d0

docs : replace Core ML with OpenVINO (#2686) Konosuke Sakai 2025-01-02 19:03:02 +0900
425d3add59 Fix windows build Tamotsu Takahashi 2025-01-02 13:53:26 +0900
17c7600416 Fix inconsistency of ifdef Tamotsu Takahashi 2025-01-02 13:35:06 +0900
61222da541 Fix windows build (include fcntl.h) Tamotsu Takahashi 2025-01-02 12:44:39 +0900
03b25dd7f3 Remove unused n_new_line Tamotsu Takahashi 2025-01-02 12:40:11 +0900
75099f9f87 Fix armv7-linux build Tamotsu Takahashi 2025-01-02 12:32:01 +0900