whisper.cpp

extern/whisper.cpp

Fork 1

mirror of https://github.com/ggerganov/whisper.cpp.git synced 2025-02-02 19:39:44 +01:00

Commit Graph

Select branches

Hide Pull Requests

arghh

avx512

batched

bench-memcpy

chess

coreml-with-state

cuda-cublas-opts

diarization

distil-support

experiment/model-compression

fa-decoder

feature/debug-gradle-signing

fix-bench

fix-coreml-ane

fix-vzip

gg/alloc-enc-results

gg/chess

gg/ci-cuda-fix

gg/ci-fix-android

gg/ci-fix-windows

gg/cuda-fix-mmvq

gg/cuda-no-async

gg/disable-cuda-graphs

gg/fix-external-encoder

gg/hipblas-fix

gg/objc

gg/prompt-tokens

gg/reduce-ctx-use

gg/wchess

ggml-backend

ggml-backend-no-sched

ggml-conv

grammar-debug

guided

java-bindings

large-v3

llama-podcast

macros-cvt-fp16

master

metal

metal-and-alloc

nvblas

parallel-states

quantize-encoder

stream

talk.llama-coreml

threads

timing

try-fix-abort

word-ts-2

#1001

#1002

#1003

#1010

#1012

#1015

#102

#1021

#1021

#1024

#1027

#1029

#1031

#1032

#1034

#1037

#1041

#1042

#1045

#1046

#1049

#1054

#1058

#1060

#1062

#1064

#1067

#107

#1074

#1074

#1077

#1081

#1086

#1086

#1092

#1097

#1097

#110

#1101

#111

#1110

#1111

#1112

#1113

#1114

#1115

#1118

#1118

#1120

#1124

#1128

#1129

#1130

#1131

#1134

#1136

#1137

#114

#1142

#1143

#1144

#1147

#1148

#115

#1154

#116

#1160

#1162

#1164

#1164

#1173

#1174

#1196

#1204

#1205

#1209

#121

#1210

#1211

#1212

#1214

#1216

#1217

#1218

#1220

#1224

#1227

#1228

#1229

#123

#1231

#1235

#1238

#124

#1243

#1247

#1250

#1251

#1253

#1254

#1255

#1261

#1261

#1263

#1264

#1265

#1267

#127

#127

#1270

#1275

#128

#1286

#1290

#1293

#1294

#1298

#130

#130

#1303

#1304

#1305

#1306

#131

#1310

#1313

#1317

#1330

#1334

#1335

#1345

#1349

#135

#1350

#1352

#1356

#1358

#136

#1362

#1364

#1368

#1370

#1375

#1375

#1380

#1381

#1381

#1382

#1389

#1400

#1404

#141

#1415

#1417

#1418

#1418

#1420

#1422

#1424

#143

#1432

#1434

#1440

#1441

#1442

#1444

#1445

#1452

#1455

#1455

#1456

#1457

#1458

#1459

#1462

#1466

#1467

#147

#1472

#1473

#1474

#1475

#1478

#1478

#1479

#1484

#1485

#1486

#1487

#1492

#1493

#1499

#1499

#150

#1500

#1500

#1501

#1505

#1519

#1521

#1522

#1523

#1524

#1524

#1529

#1530

#1533

#1534

#1535

#1539

#1541

#1544

#1545

#1546

#1547

#1548

#1549

#1549

#155

#1551

#1554

#1559

#1559

#1560

#1561

#1563

#1563

#1565

#1567

#1568

#1574

#1575

#1576

#1578

#1582

#1583

#1586

#1588

#1589

#1595

#160

#1602

#1604

#1604

#1605

#1606

#1607

#1615

#1617

#1627

#1627

#163

#1633

#1649

#1649

#1650

#1651

#1655

#1658

#1667

#1669

#1672

#1673

#1674

#1675

#1677

#1679

#1679

#1681

#1691

#1692

#1694

#1695

#170

#1701

#1703

#1704

#1713

#1714

#1716

#1717

#1725

#1727

#1728

#1729

#1735

#174

#1740

#1741

#1744

#1747

#1749

#175

#1750

#1753

#1754

#1755

#1758

#1763

#1764

#1765

#1768

#1768

#1772

#1774

#1778

#1781

#1785

#179

#1791

#1791

#1792

#1802

#1806

#1809

#1812

#1813

#1819

#1823

#1823

#183

#1833

#1833

#1838

#1839

#1840

#1841

#1841

#1842

#1850

#1854

#1854

#1857

#1859

#1860

#1861

#1863

#1865

#1871

#1872

#1874

#1878

#1888

#1889

#1890

#1891

#1895

#1897

#19

#1902

#1913

#1913

#1917

#1924

#1924

#1925

#1926

#1928

#1929

#193

#1932

#1933

#1938

#194

#1942

#1943

#1944

#1945

#1947

#195

#1952

#1952

#1953

#1964

#1965

#1966

#1969

#1969

#1970

#1973

#1973

#1978

#1978

#1980

#1981

#1982

#1983

#1990

#1990

#1994

#1997

#1998

#20

#2000

#2001

#2004

#2005

#2005

#201

#2012

#2019

#2020

#2024

#2025

#2026

#203

#203

#2043

#2044

#2045

#2048

#2049

#2054

#2058

#2063

#2068

#2068

#2069

#2070

#2071

#2071

#2072

#2073

#2075

#2075

#2080

#2086

#2088

#2090

#2094

#2095

#2095

#21

#2100

#2102

#2108

#2115

#2119

#2121

#2123

#2127

#2127

#2128

#2129

#2133

#2138

#2142

#2152

#2153

#2154

#2166

#2170

#2181

#2182

#2184

#2184

#2189

#2194

#2196

#2198

#2206

#2208

#2217

#222

#2220

#2227

#2231

#2232

#2234

#2235

#2236

#2237

#2238

#2239

#224

#2240

#2242

#2254

#2254

#2256

#2261

#2264

#2266

#2267

#2270

#2272

#2272

#2279

#2279

#228

#2288

#229

#2290

#2291

#2291

#2294

#2299

#23

#230

#2302

#231

#2311

#2324

#2330

#2330

#2336

#2339

#2339

#2342

#2343

#2346

#2350

#2358

#2360

#2367

#2369

#2369

#2376

#2376

#2382

#2383

#2384

#2384

#2386

#2387

#239

#2391

#2393

#2396

#24

#2401

#2406

#2406

#2407

#2410

#2414

#2416

#2417

#2419

#2424

#2425

#2427

#2429

#2431

#2432

#2432

#2433

#2440

#2443

#2444

#2449

#245

#2451

#2455

#2464

#2475

#2477

#2481

#2484

#2485

#2488

#2489

#2495

#2505

#2506

#2511

#2515

#2516

#2517

#2518

#2519

#252

#2523

#2525

#2528

#2529

#253

#2534

#254

#2543

#2546

#2547

#2548

#2549

#2550

#2551

#2555

#2560

#2560

#2561

#2562

#2567

#2569

#2569

#257

#2570

#2573

#2574

#2576

#2576

#2577

#2577

#2579

#2579

#2580

#2585

#2589

#2593

#2593

#260

#2604

#2608

#2611

#2613

#2617

#2623

#2624

#2625

#2629

#2633

#2634

#2634

#2635

#2637

#2638

#2639

#2641

#2642

#2643

#2648

#2649

#2653

#2654

#2656

#2659

#2663

#2664

#2670

#2674

#2676

#2683

#2684

#2684

#2686

#2687

#2690

#2690

#2691

#2691

#2692

#2693

#2694

#2694

#2699

#27

#2700

#2707

#2709

#271

#2711

#2716

#2718

#2728

#273

#2734

#2736

#2737

#274

#2745

#2749

#2756

#2756

#2759

#2759

#2760

#2760

#2769

#2769

#277

#2770

#2770

#28

#282

#284

#284

#285

#286

#287

#288

#29

#291

#294

#296

#298

#299

#3

#301

#302

#306

#308

#31

#317

#318

#319

#320

#322

#323

#324

#331

#336

#34

#340

#343

#343

#345

#346

#349

#350

#351

#353

#357

#359

#36

#362

#365

#366

#368

#369

#379

#38

#381

#383

#384

#387

#388

#390

#391

#398

#404

#409

#41

#415

#42

#424

#425

#43

#431

#435

#436

#439

#443

#444

#446

#451

#453

#454

#454

#455

#456

#459

#461

#462

#468

#473

#474

#476

#482

#484

#485

#486

#494

#495

#497

#500

#501

#502

#502

#503

#506

#515

#520

#523

#532

#534

#537

#538

#540

#542

#552

#563

#566

#569

#572

#576

#58

#583

#60

#600

#605

#613

#613

#615

#619

#624

#624

#626

#627

#628

#629

#629

#638

#640

#642

#645

#648

#649

#650

#650

#659

#659

#664

#668

#67

#677

#682

#685

#686

#687

#688

#697

#70

#704

#706

#710

#711

#712

#716

#718

#72

#720

#721

#725

#728

#733

#737

#739

#740

#755

#759

#760

#763

#764

#768

#77

#776

#78

#798

#81

#810

#811

#812

#815

#816

#832

#833

#834

#835

#836

#837

#842

#845

#853

#854

#862

#863

#867

#87

#871

#871

#874

#875

#883

#885

#890

#891

#891

#893

#899

#902

#908

#910

#915

#926

#927

#931

#935

#939

#939

#94

#944

#95

#956

#964

#968

#968

#971

#971

#972

#995

0.0.5-3

0.0.6-1

1.0.3

1.0.4

1.1.0

1.4.1-1

1.4.1-2

1.5.2

v1.0.4

v1.1.0

v1.1.1

v1.2.0

v1.2.1

v1.3.0

v1.4.0

v1.4.1

v1.4.2

v1.4.3

v1.5.0

v1.5.1

v1.5.2

v1.5.3

v1.5.4

v1.5.5

v1.6.0

v1.6.1

v1.6.2

v1.7.0

v1.7.1

v1.7.2

v1.7.2-pre

v1.7.3

v1.7.3-pre

v1.7.4

v1.7.4-pre-0

v1.7.4-pre-1

49c33aa40d

Fix typo in download-ggml-model.sh Michael Rienstra 2024-12-11 13:49:59 -0800
262e865a70

ruby : Sync whisper.cpp and model download feature (#2617) KITAITI Makoto 2024-12-09 20:17:50 +0900
d4e47945e3 Use conditional get when get model files Kitaiti Makoto 2024-12-03 23:15:15 +0900
a0f3d8a831 Cosmetic fix Kitaiti Makoto 2024-12-01 08:26:47 +0900
4559a70035 Don't care about no longer included file Kitaiti Makoto 2024-12-01 08:24:49 +0900
b8a5c85780 Remove unused function Kitaiti Makoto 2024-12-01 08:04:27 +0900
0ed5b2399c Add headings to API section in README [skip ci] Kitaiti Makoto 2024-12-01 02:24:03 +0900
9e50697dc1 Update documents Kitaiti Makoto 2024-12-01 02:10:01 +0900
d8d89d73e4 Add shorthand for pre-converted models Kitaiti Makoto 2024-12-01 02:09:44 +0900
3fd13ae71f Make Whisper::Context#initialize accept Pathname Kitaiti Makoto 2024-11-29 22:09:55 +0900
d862e8359c Add test for Pathname of model Kitaiti Makoto 2024-11-29 22:05:52 +0900
b53b44e0ff Use C++17 Kitaiti Makoto 2024-12-06 23:05:42 +0900
ed733e85a1

scripts : update to new build system v1.7.3-pre Georgi Gerganov 2024-12-09 11:30:16 +0200
5980b1ae77

devops : add cmake Georgi Gerganov 2024-12-08 23:09:26 +0200
0415a66044

devops : update make commands Georgi Gerganov 2024-12-08 23:07:29 +0200
7d134e3737

ggml : remove old files (skip) (#0) Georgi Gerganov 2024-12-08 23:04:26 +0200
9df53b357e

ggml : sync remnants (skip) (#0) Georgi Gerganov 2024-12-08 22:48:25 +0200
b2115b4d9b

scripts : remove amx from sync Georgi Gerganov 2024-12-08 22:48:14 +0200
0164427dd5 ci : disable freeBSD builds [no ci] Georgi Gerganov 2024-12-08 15:52:57 +0200
627b11c78a readme : update build instructions Georgi Gerganov 2024-12-08 15:48:14 +0200
472464453d ci : disable CUDA and Android builds Georgi Gerganov 2024-12-08 15:36:01 +0200
11dddfbc9e ci : disable Obj-C build + fixes Georgi Gerganov 2024-12-08 13:35:35 +0200
384e214cc7 make : shim cmake Georgi Gerganov 2024-12-06 15:34:53 +0200
f2c680f893 talk-llama : sync llama.cpp Georgi Gerganov 2024-12-05 14:30:33 +0200
fbe66da0e5 sync : ggml Georgi Gerganov 2024-12-05 14:29:18 +0200
a815940e0e ggml : add predefined list of CPU backend variants to build (llama/10626) Diego Devesa 2024-12-04 14:45:40 +0100
904e307bce ggml-cpu : fix HWCAP2_I8MM value (llama/10646) Diego Devesa 2024-12-04 14:40:44 +0100
491ec076b4 vulkan: Implement "fast divide" (mul+shift) for unary ops like copy (llama/10642) Jeff Bolz 2024-12-04 01:28:59 -0600
966433fdf2 SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (llama/10584) Nicolò Scipione 2024-12-04 02:29:20 +0100
6f1ba9d82d Avoid using __fp16 on ARM with old nvcc (llama/10616) Frankie Robertson 2024-12-04 02:41:37 +0200
015ecd0001 vulkan: optimize and reenable split_k (llama/10637) Jeff Bolz 2024-12-03 13:29:54 -0600
b7c64a4352 ggml: add GGML_SET Metal kernel + i32 CPU kernel (ggml/1037) PAB 2024-12-04 09:19:30 +0100
7895d39508 ggml : add GGML_PAD_REFLECT_1D operation (ggml/1034) PAB 2024-12-03 20:20:04 +0100
22616f00f9 files : remove make artifacts Georgi Gerganov 2024-12-03 20:29:32 +0200
02c6fcbc2c common : fix compile warning Georgi Gerganov 2024-12-03 20:25:37 +0200
3daeacad24 ggml : move AMX to the CPU backend (llama/10570) Diego Devesa 2024-12-03 20:22:12 +0200
4d73962da4 metal : small-batch mat-mul kernels (llama/10581) Georgi Gerganov 2024-12-03 11:52:33 +0200
068812650e SYCL: Fix and switch to GGML_LOG system instead of fprintf (llama/10579) Akarshan Biswas 2024-12-02 12:34:11 +0530
4b7e059e15 ggml-cpu: replace AArch64 NEON assembly with intrinsics in ggml_gemv_q4_0_4x4_q8_0() (llama/10567) Adrien Gallouët 2024-11-30 18:13:18 +0100
30e35d7271 vulkan: Dynamic subgroup size support for Q6_K mat_vec (llama/10536) Eve 2024-11-30 07:00:02 +0000
3623bd58f2 ggml : fix I8MM Q4_1 scaling factor conversion (llama/10562) Georgi Gerganov 2024-11-29 16:25:39 +0200
cb847c20a7 ggml-cpu: fix typo in gemv/gemm iq4_nl_4_4 (llama/10580) Shupei Fan 2024-11-29 21:49:02 +0800
964b154a2a sycl : offload of get_rows set to 0 (llama/10432) Alberto Cabrera Pérez 2024-11-29 12:38:45 +0000
d7c2a04bce sycl : Reroute permuted mul_mats through oneMKL (llama/10408) Alberto Cabrera Pérez 2024-11-29 09:49:43 +0000
2bb4ca9cba CANN: RoPE operator optimization (llama/10563) Chenguang Li 2024-11-29 14:46:55 +0800
a753a82462 vulkan: get the first command buffer submitted sooner (llama/10499) Jeff Bolz 2024-11-29 00:18:02 -0600
276b08d8f0 ggml : remove redundant copyright notice + update authors Georgi Gerganov 2024-11-28 20:46:40 +0200
4ca1e72fe0 ggml : fix row condition for i8mm kernels (llama/10561) Georgi Gerganov 2024-11-28 14:56:37 +0200
16a66f103f cmake : fix ARM feature detection (llama/10543) Georgi Gerganov 2024-11-28 14:56:23 +0200
330273901f ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541) Shupei Fan 2024-11-28 20:52:03 +0800
42099a9342 kompute : improve backend to pass test_backend_ops (llama/10542) Sergio López 2024-11-28 12:51:38 +0100
90dd5fca9c CANN: Fix SOC_TYPE compile bug (llama/10519) leo-pony 2024-11-28 15:25:24 +0800
2490f2a7f8 CANN: ROPE operator optimization (llama/10540) Chenguang Li 2024-11-28 14:24:46 +0800
230e985633 Add some minimal optimizations for CDNA (llama/10498) uvos 2024-11-27 17:10:08 +0100
ae24083f23 metal : fix group_norm support condition (llama/0) Georgi Gerganov 2024-11-27 11:22:14 +0200
6463e36369 vulkan: define all quant data structures in types.comp (llama/10440) Jeff Bolz 2024-11-27 01:32:54 -0600
b3301f7d82 vulkan: Handle GPUs with less shared memory (llama/10468) Jeff Bolz 2024-11-27 01:30:27 -0600
ab5d4d93ec vulkan: further optimize q5_k mul_mat_vec (llama/10479) Jeff Bolz 2024-11-27 01:21:59 -0600
2d6e9dd723 vulkan: skip integer div/mod in get_offsets for batch_idx==0 (llama/10506) Jeff Bolz 2024-11-27 01:08:54 -0600
2f16e51553 vulkan: optimize Q2_K and Q3_K mul_mat_vec (llama/10459) Jeff Bolz 2024-11-27 01:00:50 -0600
0f0994902f mtgpu: Add MUSA_DOCKER_ARCH in Dockerfiles && update cmake and make (llama/10516) R0CKSTAR 2024-11-27 00:00:41 +0800
5e1fcc1780 vulkan: fix group_norm (llama/10496) Jeff Bolz 2024-11-26 09:45:05 -0600
48f421de23 cmake : enable warnings in llama (llama/10474) Georgi Gerganov 2024-11-26 14:18:08 +0200
e7afb2b991 ggml-cpu: cmake add arm64 cpu feature check for macos (llama/10487) Charles Xu 2024-11-26 12:37:05 +0100
9a5ef7b169 CANN: Improve the Inferencing Performance for Ascend NPU Device (llama/10454) Shanshan Shen 2024-11-26 18:08:37 +0800
453cc0fcf1 CANN: RoPE and CANCAT operator optimization (llama/10488) Chenguang Li 2024-11-26 17:31:05 +0800
78dfec6bc5 vulkan: Fix a vulkan-shaders-gen arugment parsing error (llama/10484) Junil Kim 2024-11-26 10:47:20 +0900
f6d518fc4c metal : enable mat-vec kernels for bs <= 4 (llama/10491) Georgi Gerganov 2024-11-25 21:49:31 +0200
ac33379a35 llama : accept a list of devices to use to offload a model (llama/10497) Diego Devesa 2024-11-25 19:30:06 +0100
77e3e4a090 ggml : add support for dynamic loading of backends (llama/10469) Diego Devesa 2024-11-25 15:13:39 +0100
b840bb09be metal : minor code formatting Georgi Gerganov 2024-11-25 15:08:04 +0200
8b1c1c30a7 ggml : do not use ARM features not included in the build (llama/10457) Diego Devesa 2024-11-23 14:41:12 +0100
4b81335f75 CANN: Support Ascend310P to accelerate F32 and F16 Model (llama/10216) leo-pony 2024-11-22 14:07:20 +0800
2a4b5c9d7e cuda : optimize argmax (llama/10441) Diego Devesa 2024-11-21 18:18:50 +0100
04662748aa vulkan: predicate max operation in soft_max shaders/soft_max (llama/10437) Jeff Bolz 2024-11-20 13:47:36 -0600
a117279e13 vulkan: copy iq4_nl LUT into shared memory (llama/10409) Jeff Bolz 2024-11-20 01:40:18 -0600
bbb292ed38 vulkan: further optimize mul_mat_vec using larger loads (llama/10387) Jeff Bolz 2024-11-20 01:11:00 -0600
95e8901e71 add cmake rvv support (llama/10411) haopeng 2024-11-20 04:10:31 +0800
4af9626702 CUDA: remove unnecessary warp reduce in FA (ggml/1032) mahorozte 2024-12-03 21:11:43 +0800
c52d1035de feat: add GGML_UNARY_OP_ARGMAX Metal kernel (ggml/1019) PAB 2024-12-02 19:27:24 +0100
5773a14980 metal : add GGML_OP_CONV_TRANSPOSE_1D kernels (ggml/1026) PAB 2024-11-28 09:25:06 +0100
6939147c47 Do not include arm_neon.h when compiling CUDA code (ggml/1028) Frankie Robertson 2024-11-26 15:50:26 +0200
98f9916c9f ggml-opt: fix data corruption (ggml/1022) Johannes Gäßler 2024-11-20 14:56:04 +0100
280d2735bc

ci : disable freeBSD builds [no ci] Georgi Gerganov 2024-12-08 15:52:57 +0200
668930a989

readme : update build instructions Georgi Gerganov 2024-12-08 15:48:14 +0200
762f63e2d0

ci : disable CUDA and Android builds Georgi Gerganov 2024-12-08 15:36:01 +0200
a5cd03a921

ci : disable Obj-C build + fixes Georgi Gerganov 2024-12-08 13:35:35 +0200
e3d545e5a6 Fix vulkan Makefile paths Lluís Batlle i Rossell 2024-12-07 12:27:48 +0100
ae769eae71 Use C++17 Kitaiti Makoto 2024-12-06 23:05:42 +0900
729effe4cf

make : shim cmake Georgi Gerganov 2024-12-06 15:34:53 +0200
1a1fcd37cf

talk-llama : sync llama.cpp Georgi Gerganov 2024-12-05 14:30:33 +0200
dfe6652b0d

sync : ggml Georgi Gerganov 2024-12-05 14:29:18 +0200
dfddca02ec

ggml : add predefined list of CPU backend variants to build (llama/10626) Diego Devesa 2024-12-04 14:45:40 +0100
61aff48839

ggml-cpu : fix HWCAP2_I8MM value (llama/10646) Diego Devesa 2024-12-04 14:40:44 +0100
b311da34cf

vulkan: Implement "fast divide" (mul+shift) for unary ops like copy (llama/10642) Jeff Bolz 2024-12-04 01:28:59 -0600
3085e2883a

SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (llama/10584) Nicolò Scipione 2024-12-04 02:29:20 +0100
9623ba19b2

Avoid using __fp16 on ARM with old nvcc (llama/10616) Frankie Robertson 2024-12-04 02:41:37 +0200
03331b1de8

vulkan: optimize and reenable split_k (llama/10637) Jeff Bolz 2024-12-03 13:29:54 -0600
e20efac003

ggml: add GGML_SET Metal kernel + i32 CPU kernel (ggml/1037) PAB 2024-12-04 09:19:30 +0100
40d5987bf3

ggml : add GGML_PAD_REFLECT_1D operation (ggml/1034) PAB 2024-12-03 20:20:04 +0100