whisper.cpp

extern/whisper.cpp

Fork 1

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-02-02 19:39:44 +01:00

Commit Graph

Select branches

Hide Pull Requests

arghh

avx512

batched

bench-memcpy

chess

coreml-with-state

cuda-cublas-opts

diarization

distil-support

experiment/model-compression

fa-decoder

feature/debug-gradle-signing

fix-bench

fix-coreml-ane

fix-vzip

gg/alloc-enc-results

gg/chess

gg/ci-cuda-fix

gg/ci-fix-android

gg/ci-fix-windows

gg/cuda-fix-mmvq

gg/cuda-no-async

gg/disable-cuda-graphs

gg/fix-external-encoder

gg/hipblas-fix

gg/objc

gg/prompt-tokens

gg/reduce-ctx-use

gg/wchess

ggml-backend

ggml-backend-no-sched

ggml-conv

grammar-debug

guided

java-bindings

large-v3

llama-podcast

macros-cvt-fp16

master

metal

metal-and-alloc

nvblas

parallel-states

quantize-encoder

stream

talk.llama-coreml

threads

timing

try-fix-abort

word-ts-2

#1001

#1002

#1003

#1010

#1012

#1015

#102

#1021

#1021

#1024

#1027

#1029

#1031

#1032

#1034

#1037

#1041

#1042

#1045

#1046

#1049

#1054

#1058

#1060

#1062

#1064

#1067

#107

#1074

#1074

#1077

#1081

#1086

#1086

#1092

#1097

#1097

#110

#1101

#111

#1110

#1111

#1112

#1113

#1114

#1115

#1118

#1118

#1120

#1124

#1128

#1129

#1130

#1131

#1134

#1136

#1137

#114

#1142

#1143

#1144

#1147

#1148

#115

#1154

#116

#1160

#1162

#1164

#1164

#1173

#1174

#1196

#1204

#1205

#1209

#121

#1210

#1211

#1212

#1214

#1216

#1217

#1218

#1220

#1224

#1227

#1228

#1229

#123

#1231

#1235

#1238

#124

#1243

#1247

#1250

#1251

#1253

#1254

#1255

#1261

#1261

#1263

#1264

#1265

#1267

#127

#127

#1270

#1275

#128

#1286

#1290

#1293

#1294

#1298

#130

#130

#1303

#1304

#1305

#1306

#131

#1310

#1313

#1317

#1330

#1334

#1335

#1345

#1349

#135

#1350

#1352

#1356

#1358

#136

#1362

#1364

#1368

#1370

#1375

#1375

#1380

#1381

#1381

#1382

#1389

#1400

#1404

#141

#1415

#1417

#1418

#1418

#1420

#1422

#1424

#143

#1432

#1434

#1440

#1441

#1442

#1444

#1445

#1452

#1455

#1455

#1456

#1457

#1458

#1459

#1462

#1466

#1467

#147

#1472

#1473

#1474

#1475

#1478

#1478

#1479

#1484

#1485

#1486

#1487

#1492

#1493

#1499

#1499

#150

#1500

#1500

#1501

#1505

#1519

#1521

#1522

#1523

#1524

#1524

#1529

#1530

#1533

#1534

#1535

#1539

#1541

#1544

#1545

#1546

#1547

#1548

#1549

#1549

#155

#1551

#1554

#1559

#1559

#1560

#1561

#1563

#1563

#1565

#1567

#1568

#1574

#1575

#1576

#1578

#1582

#1583

#1586

#1588

#1589

#1595

#160

#1602

#1604

#1604

#1605

#1606

#1607

#1615

#1617

#1627

#1627

#163

#1633

#1649

#1649

#1650

#1651

#1655

#1658

#1667

#1669

#1672

#1673

#1674

#1675

#1677

#1679

#1679

#1681

#1691

#1692

#1694

#1695

#170

#1701

#1703

#1704

#1713

#1714

#1716

#1717

#1725

#1727

#1728

#1729

#1735

#174

#1740

#1741

#1744

#1747

#1749

#175

#1750

#1753

#1754

#1755

#1758

#1763

#1764

#1765

#1768

#1768

#1772

#1774

#1778

#1781

#1785

#179

#1791

#1791

#1792

#1802

#1806

#1809

#1812

#1813

#1819

#1823

#1823

#183

#1833

#1833

#1838

#1839

#1840

#1841

#1841

#1842

#1850

#1854

#1854

#1857

#1859

#1860

#1861

#1863

#1865

#1871

#1872

#1874

#1878

#1888

#1889

#1890

#1891

#1895

#1897

#19

#1902

#1913

#1913

#1917

#1924

#1924

#1925

#1926

#1928

#1929

#193

#1932

#1933

#1938

#194

#1942

#1943

#1944

#1945

#1947

#195

#1952

#1952

#1953

#1964

#1965

#1966

#1969

#1969

#1970

#1973

#1973

#1978

#1978

#1980

#1981

#1982

#1983

#1990

#1990

#1994

#1997

#1998

#20

#2000

#2001

#2004

#2005

#2005

#201

#2012

#2019

#2020

#2024

#2025

#2026

#203

#203

#2043

#2044

#2045

#2048

#2049

#2054

#2058

#2063

#2068

#2068

#2069

#2070

#2071

#2071

#2072

#2073

#2075

#2075

#2080

#2086

#2088

#2090

#2094

#2095

#2095

#21

#2100

#2102

#2108

#2115

#2119

#2121

#2123

#2127

#2127

#2128

#2129

#2133

#2138

#2142

#2152

#2153

#2154

#2166

#2170

#2181

#2182

#2184

#2184

#2189

#2194

#2196

#2198

#2206

#2208

#2217

#222

#2220

#2227

#2231

#2232

#2234

#2235

#2236

#2237

#2238

#2239

#224

#2240

#2242

#2254

#2254

#2256

#2261

#2264

#2266

#2267

#2270

#2272

#2272

#2279

#2279

#228

#2288

#229

#2290

#2291

#2291

#2294

#2299

#23

#230

#2302

#231

#2311

#2324

#2330

#2330

#2336

#2339

#2339

#2342

#2343

#2346

#2350

#2358

#2360

#2367

#2369

#2369

#2376

#2376

#2382

#2383

#2384

#2384

#2386

#2387

#239

#2391

#2393

#2396

#24

#2401

#2406

#2406

#2407

#2410

#2414

#2416

#2417

#2419

#2424

#2425

#2427

#2429

#2431

#2432

#2432

#2433

#2440

#2443

#2444

#2449

#245

#2451

#2455

#2464

#2475

#2477

#2481

#2484

#2485

#2488

#2489

#2495

#2505

#2506

#2511

#2515

#2516

#2517

#2518

#2519

#252

#2523

#2525

#2528

#2529

#253

#2534

#254

#2543

#2546

#2547

#2548

#2549

#2550

#2551

#2555

#2560

#2560

#2561

#2562

#2567

#2569

#2569

#257

#2570

#2573

#2574

#2576

#2576

#2577

#2577

#2579

#2579

#2580

#2585

#2589

#2593

#2593

#260

#2604

#2608

#2611

#2613

#2617

#2623

#2624

#2625

#2629

#2633

#2634

#2634

#2635

#2637

#2638

#2639

#2641

#2642

#2643

#2648

#2649

#2653

#2654

#2656

#2659

#2663

#2664

#2670

#2674

#2676

#2683

#2684

#2684

#2686

#2687

#2690

#2690

#2691

#2691

#2692

#2693

#2694

#2694

#2699

#27

#2700

#2707

#2709

#271

#2711

#2716

#2718

#2728

#273

#2734

#2736

#2737

#274

#2745

#2749

#2756

#2756

#2759

#2759

#2760

#2760

#2769

#2769

#277

#2770

#2770

#28

#282

#284

#284

#285

#286

#287

#288

#29

#291

#294

#296

#298

#299

#3

#301

#302

#306

#308

#31

#317

#318

#319

#320

#322

#323

#324

#331

#336

#34

#340

#343

#343

#345

#346

#349

#350

#351

#353

#357

#359

#36

#362

#365

#366

#368

#369

#379

#38

#381

#383

#384

#387

#388

#390

#391

#398

#404

#409

#41

#415

#42

#424

#425

#43

#431

#435

#436

#439

#443

#444

#446

#451

#453

#454

#454

#455

#456

#459

#461

#462

#468

#473

#474

#476

#482

#484

#485

#486

#494

#495

#497

#500

#501

#502

#502

#503

#506

#515

#520

#523

#532

#534

#537

#538

#540

#542

#552

#563

#566

#569

#572

#576

#58

#583

#60

#600

#605

#613

#613

#615

#619

#624

#624

#626

#627

#628

#629

#629

#638

#640

#642

#645

#648

#649

#650

#650

#659

#659

#664

#668

#67

#677

#682

#685

#686

#687

#688

#697

#70

#704

#706

#710

#711

#712

#716

#718

#72

#720

#721

#725

#728

#733

#737

#739

#740

#755

#759

#760

#763

#764

#768

#77

#776

#78

#798

#81

#810

#811

#812

#815

#816

#832

#833

#834

#835

#836

#837

#842

#845

#853

#854

#862

#863

#867

#87

#871

#871

#874

#875

#883

#885

#890

#891

#891

#893

#899

#902

#908

#910

#915

#926

#927

#931

#935

#939

#939

#94

#944

#95

#956

#964

#968

#968

#971

#971

#972

#995

0.0.5-3

0.0.6-1

1.0.3

1.0.4

1.1.0

1.4.1-1

1.4.1-2

1.5.2

v1.0.4

v1.1.0

v1.1.1

v1.2.0

v1.2.1

v1.3.0

v1.4.0

v1.4.1

v1.4.2

v1.4.3

v1.5.0

v1.5.1

v1.5.2

v1.5.3

v1.5.4

v1.5.5

v1.6.0

v1.6.1

v1.6.2

v1.7.0

v1.7.1

v1.7.2

v1.7.2-pre

v1.7.3

v1.7.3-pre

v1.7.4

v1.7.4-pre-0

v1.7.4-pre-1

7ac2f17fac cuda : only use native when supported by cmake (llama/10389) Diego Devesa 2024-11-18 18:43:40 +0100
48862c7b27 vulkan: remove use of null initializer (llama/10372) Jeff Bolz 2024-11-18 08:28:42 -0600
44f7d9f4e3 metal : fox offset integer overflows in im2col (ggml/1015) Plamen Minev 2024-11-18 15:02:27 +0200
fd12302587 Vulkan: Fix device info output format specifiers (llama/10366) 0cc4m 2024-11-18 11:02:43 +0100
f80bef4630 metal : add GGML_UNARY_OP_ELU kernel (ggml/1018) PAB 2024-11-18 10:02:49 +0100
161b443514 CUDA: fix MMV kernel being used for FP16 src1 (llama/10357) Johannes Gäßler 2024-11-17 23:20:42 +0100
ef7fbe1c66 CMake: fix typo in comment [no ci] (llama/10360) Johannes Gäßler 2024-11-17 12:59:38 +0100
0879d3599e llama : only use default buffer types for the KV cache (llama/10358) Diego Devesa 2024-11-17 12:25:45 +0100
2a444dc5bd metal : refactor kernel args into structs (llama/10238) Georgi Gerganov 2024-11-17 11:23:01 +0200
45cf1634dc ggml : fix undefined reference to 'getcpu' (llama/10354) FirstTimeEZ 2024-11-17 21:39:22 +1300
dcb2922d1d CUDA: remove DMMV, consolidate F16 mult mat vec (llama/10318) Johannes Gäßler 2024-11-17 09:09:55 +0100
3c5c751174 CMake: default to -arch=native for CUDA build (llama/10320) Johannes Gäßler 2024-11-17 09:06:34 +0100
24ad19d0e9 ggml : fix possible buffer use after free in sched reserve (llama/9930) Diego Devesa 2024-11-17 07:31:17 +0100
bd574b05af ggml : inttypes.h -> cinttypes (llama/0) Georgi Gerganov 2024-11-16 23:40:39 +0200
7e0eafcb1e ggml : adapt AMX to tensor->grad removal (llama/0) Georgi Gerganov 2024-11-16 21:38:01 +0200
75670ae673 ggml : fix compile warnings (llama/0) Georgi Gerganov 2024-11-16 21:32:41 +0200
d4fcdf602b llamafile : fix include path (llama/0) Georgi Gerganov 2024-11-16 17:58:56 +0200
1bebb1a116 vulkan: Optimize some mat-vec mul quant shaders (llama/10296) Jeff Bolz 2024-11-16 00:26:57 -0600
ee437cde59 ggml : optimize Q4_0 into Q4_0_X_Y repack (llama/10324) Dan Johansson 2024-11-16 01:53:37 +0100
c1506d38cf Make updates to fix issues with clang-cl builds while using AVX512 flags (llama/10314) Srihari-mcw 2024-11-16 02:57:00 +0530
c9541741e6 ggml: new optimization interface (ggml/988) Johannes Gäßler 2024-11-16 13:49:35 +0100
6a55015dc4 ggml : remove duplicated sources from the last sync (ggml/1017) Georgi Gerganov 2024-11-15 23:52:31 +0200
7e86030d4d ggml : fix some build issues slaren 2024-11-15 20:20:54 +0100
401fbea326 sync : leftovers (ggml/0) Georgi Gerganov 2024-11-15 21:43:41 +0200
44d1cbdfe9 cmake : restore CMakeLists.txt (llama/10256) Georgi Gerganov 2024-11-15 21:35:51 +0200
3216efef2e AVX BF16 and single scale quant optimizations (llama/10212) Eve 2024-11-15 11:47:58 +0000
2c0484ebf7 sycl: Use syclcompat::dp4a (llama/10267) Romain Biessy 2024-11-15 04:09:12 +0100
3298916e5e backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (llama/9921) Charles Xu 2024-11-15 01:28:50 +0100
746bf2596f ggml : build backends as libraries (llama/10256) Diego Devesa 2024-11-14 18:04:35 +0100
5f7e094ccb scripts : update sync Georgi Gerganov 2024-11-19 18:59:18 +0200
e6114173b8

whisper : use backend registry (#0) Georgi Gerganov 2024-11-20 15:32:34 +0200
85ff4f974e Fix crash in ggml_vk_print_gpu_info Juliusz Chroboczek 2024-11-20 16:35:35 +0100
c800966378 ggml/sched : do not skip views in pre-assignments slaren 2024-11-20 13:25:08 +0100
8c24c64924

whisper : adapt to new ggml (wip) Georgi Gerganov 2024-11-19 19:09:07 +0200
4e1f516ecc

talk-llama : sync llama.cpp Georgi Gerganov 2024-11-19 19:08:57 +0200
0eddc9fcbc

sync : ggml Georgi Gerganov 2024-11-19 19:04:21 +0200
52799f9082

ggml : sync resolve (skip) (#0) Georgi Gerganov 2024-11-19 19:03:47 +0200
bfaf1fc76f

Add required ggml-base and backend libs to cmake pkg (llama/10407) bandoti 2024-11-19 12:10:30 -0400
166237d07e

cuda : fix CUDA_FLAGS not being applied (llama/10403) Diego Devesa 2024-11-19 14:29:38 +0100
d2aaf9ecfc

sycl : Add option to set the SYCL architecture for all targets (llama/10266) Romain Biessy 2024-11-19 09:02:23 +0100
29894ef822

vulkan: Optimize soft_max (llama/10301) Jeff Bolz 2024-11-19 01:25:17 -0600
8d6e30fb61

sycl: Revert MUL_MAT_OP support changes (llama/10385) Alberto Cabrera Pérez 2024-11-19 00:50:04 +0000
761d310e78

cuda : only use native when supported by cmake (llama/10389) Diego Devesa 2024-11-18 18:43:40 +0100
c4f4639466

vulkan: remove use of null initializer (llama/10372) Jeff Bolz 2024-11-18 08:28:42 -0600
c157f624e2

metal : fox offset integer overflows in im2col (ggml/1015) Plamen Minev 2024-11-18 15:02:27 +0200
748d633638

Vulkan: Fix device info output format specifiers (llama/10366) 0cc4m 2024-11-18 11:02:43 +0100
937684c822

metal : add GGML_UNARY_OP_ELU kernel (ggml/1018) PAB 2024-11-18 10:02:49 +0100
58b5fc45b9

CUDA: fix MMV kernel being used for FP16 src1 (llama/10357) Johannes Gäßler 2024-11-17 23:20:42 +0100
fcd8ea6aff

CMake: fix typo in comment [no ci] (llama/10360) Johannes Gäßler 2024-11-17 12:59:38 +0100
6b4de57e65

llama : only use default buffer types for the KV cache (llama/10358) Diego Devesa 2024-11-17 12:25:45 +0100
dca00d8374

metal : refactor kernel args into structs (llama/10238) Georgi Gerganov 2024-11-17 11:23:01 +0200
a901ba0716

ggml : fix undefined reference to 'getcpu' (llama/10354) FirstTimeEZ 2024-11-17 21:39:22 +1300
8bd8688888

CUDA: remove DMMV, consolidate F16 mult mat vec (llama/10318) Johannes Gäßler 2024-11-17 09:09:55 +0100
77ea626d26

CMake: default to -arch=native for CUDA build (llama/10320) Johannes Gäßler 2024-11-17 09:06:34 +0100
c96434f2b3

ggml : fix possible buffer use after free in sched reserve (llama/9930) Diego Devesa 2024-11-17 07:31:17 +0100
3f1a78d6f8

ggml : inttypes.h -> cinttypes (llama/0) Georgi Gerganov 2024-11-16 23:40:39 +0200
600728ea21

ggml : adapt AMX to tensor->grad removal (llama/0) Georgi Gerganov 2024-11-16 21:38:01 +0200
e726307095

ggml : fix compile warnings (llama/0) Georgi Gerganov 2024-11-16 21:32:41 +0200
7caa6b2e83

llamafile : fix include path (llama/0) Georgi Gerganov 2024-11-16 17:58:56 +0200
68b198b438

vulkan: Optimize some mat-vec mul quant shaders (llama/10296) Jeff Bolz 2024-11-16 00:26:57 -0600
49ca4814be

ggml : optimize Q4_0 into Q4_0_X_Y repack (llama/10324) Dan Johansson 2024-11-16 01:53:37 +0100
4b8ddfbda7

Make updates to fix issues with clang-cl builds while using AVX512 flags (llama/10314) Srihari-mcw 2024-11-16 02:57:00 +0530
adf81dc329

ggml: new optimization interface (ggml/988) Johannes Gäßler 2024-11-16 13:49:35 +0100
f33c7ea0c5

ggml : remove duplicated sources from the last sync (ggml/1017) Georgi Gerganov 2024-11-15 23:52:31 +0200
83c77397e4

ggml : fix some build issues slaren 2024-11-15 20:20:54 +0100
8dffd6444c

sync : leftovers (ggml/0) Georgi Gerganov 2024-11-15 21:43:41 +0200
1d49a2e7a2

cmake : restore CMakeLists.txt (llama/10256) Georgi Gerganov 2024-11-15 21:35:51 +0200
0df66d6586

AVX BF16 and single scale quant optimizations (llama/10212) Eve 2024-11-15 11:47:58 +0000
04d1bae6d4

sycl: Use syclcompat::dp4a (llama/10267) Romain Biessy 2024-11-15 04:09:12 +0100
41c90650a2

backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (llama/9921) Charles Xu 2024-11-15 01:28:50 +0100
ce58be7e79

ggml : build backends as libraries (llama/10256) Diego Devesa 2024-11-14 18:04:35 +0100
06c86c03d8

scripts : update sync Georgi Gerganov 2024-11-19 18:59:18 +0200
6266a9f9e5

release : v1.7.2 v1.7.2 Georgi Gerganov 2024-11-19 18:54:22 +0200
d24f981fb2

sycl: fix example build (#2570) Stefan Sydow 2024-11-18 13:57:23 +0100
4187c6ca19

sycl: fix example build Stefan Sydow 2024-11-08 22:13:29 +0100
c5b9b546b8

docs: Update README.md for whisper.objc app Tomer Schlesinger 2024-11-17 20:41:48 +0200
01d3bd7d5c

ci : use local ggml in Android build (#2567) Georgi Gerganov 2024-11-16 20:45:41 +0200
511579cc15

ci : use local ggml gg/ci-fix-android Georgi Gerganov 2024-11-16 20:31:57 +0200
bb12cd9b77

ggml : tmp workaround for whisper.cpp (skip) (#2565) Georgi Gerganov 2024-11-16 20:19:02 +0200
f02b40bcb4

update : readme v1.7.2-pre Georgi Gerganov 2024-11-15 16:00:10 +0200
83ac2842bd

scripts : fix sync path Georgi Gerganov 2024-11-15 15:24:09 +0200
c4e95fb74d

whisper.swiftui : switch Mac dest to Mac (Designed for iPad) (#2562) Jhen-Jie Hong 2024-11-15 21:21:53 +0800
e23721f3fb cmake : fix ppc64 check (#0) Georgi Gerganov 2024-11-15 09:04:34 +0200
c0a9f8ef85 whisper : include ggml-cpu.h (#0) Georgi Gerganov 2024-11-15 11:01:47 +0200
6477b84eb6 build : fixes Georgi Gerganov 2024-11-15 09:07:53 +0200
24d706774d talk-llama : sync llama.cpp Georgi Gerganov 2024-11-15 08:41:06 +0200
5089ab2d6a whisper : fix build (#0) Georgi Gerganov 2024-11-15 08:40:47 +0200
bdbb906817 sync : ggml Georgi Gerganov 2024-11-15 08:40:34 +0200
fa2ebd336e sycl : Fixes to broken builds and test-backend-ops (llama/10257) Alberto Cabrera Pérez 2024-11-13 09:40:57 +0000
21b01a21b6 vulkan: Optimize contiguous copies (llama/10254) Jeff Bolz 2024-11-13 00:58:57 -0600
b54ce5edc5 vulkan: Throttle the number of shader compiles during the build step. (llama/10222) Jeff Bolz 2024-11-11 11:13:51 -0600
26a31b78e9 metal : more precise Q*K in FA vec kernel (llama/10247) Georgi Gerganov 2024-11-11 08:39:13 +0200
14d13c5f9f vulkan: Fix newly added tests for permuted mul_mat and 1D im2col (llama/10226) Jeff Bolz 2024-11-10 05:37:56 -0600
5e110c2eb5 metal : reorder write loop in mul mat kernel + style (llama/10231) Georgi Gerganov 2024-11-09 11:53:13 +0200
4a9926d521 metal : fix build and some more comments (llama/10229) Georgi Gerganov 2024-11-09 11:53:02 +0200
ae3c5642d0 metal : fix F32 accumulation in FA vec kernel (llama/10232) Georgi Gerganov 2024-11-09 11:52:45 +0200
e287a3b627 metal : hide debug messages from normal log Georgi Gerganov 2024-11-09 11:21:49 +0200
b890243690 ggml: fix zero division in ‘dne’ calculation in CUDA COUNT_EQUAL operator when ‘ne’ is small (#10213) SXX 2024-11-09 15:35:46 +0800
b7b38f7d68 ggml : optimize llamafile cpu matrix multiplication for ppc64le (llama/10156) amritahs-ibm 2024-11-09 12:47:50 +0530
9f67aab211 metal : opt-in compile flag for BF16 (llama/10218) Georgi Gerganov 2024-11-08 21:59:46 +0200