mirror of
https://github.com/ggerganov/whisper.cpp.git
synced 2025-02-02 19:39:44 +01:00
Commit Graph
Select branches
Hide Pull Requests
arghh
avx512
batched
bench-memcpy
chess
coreml-with-state
cuda-cublas-opts
diarization
distil-support
experiment/model-compression
fa-decoder
feature/debug-gradle-signing
fix-bench
fix-coreml-ane
fix-vzip
gg/alloc-enc-results
gg/chess
gg/ci-cuda-fix
gg/ci-fix-android
gg/ci-fix-windows
gg/cuda-fix-mmvq
gg/cuda-no-async
gg/disable-cuda-graphs
gg/fix-external-encoder
gg/hipblas-fix
gg/objc
gg/prompt-tokens
gg/reduce-ctx-use
gg/wchess
ggml-backend
ggml-backend-no-sched
ggml-conv
grammar-debug
guided
java-bindings
large-v3
llama-podcast
macros-cvt-fp16
master
metal
metal-and-alloc
nvblas
parallel-states
quantize-encoder
stream
talk.llama-coreml
threads
timing
try-fix-abort
word-ts-2
#1001
#1002
#1003
#1010
#1012
#1015
#102
#1021
#1021
#1024
#1027
#1029
#1031
#1032
#1034
#1037
#1041
#1042
#1045
#1046
#1049
#1054
#1058
#1060
#1062
#1064
#1067
#107
#1074
#1074
#1077
#1081
#1086
#1086
#1092
#1097
#1097
#110
#1101
#111
#1110
#1111
#1112
#1113
#1114
#1115
#1118
#1118
#1120
#1124
#1128
#1129
#1130
#1131
#1134
#1136
#1137
#114
#1142
#1143
#1144
#1147
#1148
#115
#1154
#116
#1160
#1162
#1164
#1164
#1173
#1174
#1196
#1204
#1205
#1209
#121
#1210
#1211
#1212
#1214
#1216
#1217
#1218
#1220
#1224
#1227
#1228
#1229
#123
#1231
#1235
#1238
#124
#1243
#1247
#1250
#1251
#1253
#1254
#1255
#1261
#1261
#1263
#1264
#1265
#1267
#127
#127
#1270
#1275
#128
#1286
#1290
#1293
#1294
#1298
#130
#130
#1303
#1304
#1305
#1306
#131
#1310
#1313
#1317
#1330
#1334
#1335
#1345
#1349
#135
#1350
#1352
#1356
#1358
#136
#1362
#1364
#1368
#1370
#1375
#1375
#1380
#1381
#1381
#1382
#1389
#1400
#1404
#141
#1415
#1417
#1418
#1418
#1420
#1422
#1424
#143
#1432
#1434
#1440
#1441
#1442
#1444
#1445
#1452
#1455
#1455
#1456
#1457
#1458
#1459
#1462
#1466
#1467
#147
#1472
#1473
#1474
#1475
#1478
#1478
#1479
#1484
#1485
#1486
#1487
#1492
#1493
#1499
#1499
#150
#1500
#1500
#1501
#1505
#1519
#1521
#1522
#1523
#1524
#1524
#1529
#1530
#1533
#1534
#1535
#1539
#1541
#1544
#1545
#1546
#1547
#1548
#1549
#1549
#155
#1551
#1554
#1559
#1559
#1560
#1561
#1563
#1563
#1565
#1567
#1568
#1574
#1575
#1576
#1578
#1582
#1583
#1586
#1588
#1589
#1595
#160
#1602
#1604
#1604
#1605
#1606
#1607
#1615
#1617
#1627
#1627
#163
#1633
#1649
#1649
#1650
#1651
#1655
#1658
#1667
#1669
#1672
#1673
#1674
#1675
#1677
#1679
#1679
#1681
#1691
#1692
#1694
#1695
#170
#1701
#1703
#1704
#1713
#1714
#1716
#1717
#1725
#1727
#1728
#1729
#1735
#174
#1740
#1741
#1744
#1747
#1749
#175
#1750
#1753
#1754
#1755
#1758
#1763
#1764
#1765
#1768
#1768
#1772
#1774
#1778
#1781
#1785
#179
#1791
#1791
#1792
#1802
#1806
#1809
#1812
#1813
#1819
#1823
#1823
#183
#1833
#1833
#1838
#1839
#1840
#1841
#1841
#1842
#1850
#1854
#1854
#1857
#1859
#1860
#1861
#1863
#1865
#1871
#1872
#1874
#1878
#1888
#1889
#1890
#1891
#1895
#1897
#19
#1902
#1913
#1913
#1917
#1924
#1924
#1925
#1926
#1928
#1929
#193
#1932
#1933
#1938
#194
#1942
#1943
#1944
#1945
#1947
#195
#1952
#1952
#1953
#1964
#1965
#1966
#1969
#1969
#1970
#1973
#1973
#1978
#1978
#1980
#1981
#1982
#1983
#1990
#1990
#1994
#1997
#1998
#20
#2000
#2001
#2004
#2005
#2005
#201
#2012
#2019
#2020
#2024
#2025
#2026
#203
#203
#2043
#2044
#2045
#2048
#2049
#2054
#2058
#2063
#2068
#2068
#2069
#2070
#2071
#2071
#2072
#2073
#2075
#2075
#2080
#2086
#2088
#2090
#2094
#2095
#2095
#21
#2100
#2102
#2108
#2115
#2119
#2121
#2123
#2127
#2127
#2128
#2129
#2133
#2138
#2142
#2152
#2153
#2154
#2166
#2170
#2181
#2182
#2184
#2184
#2189
#2194
#2196
#2198
#2206
#2208
#2217
#222
#2220
#2227
#2231
#2232
#2234
#2235
#2236
#2237
#2238
#2239
#224
#2240
#2242
#2254
#2254
#2256
#2261
#2264
#2266
#2267
#2270
#2272
#2272
#2279
#2279
#228
#2288
#229
#2290
#2291
#2291
#2294
#2299
#23
#230
#2302
#231
#2311
#2324
#2330
#2330
#2336
#2339
#2339
#2342
#2343
#2346
#2350
#2358
#2360
#2367
#2369
#2369
#2376
#2376
#2382
#2383
#2384
#2384
#2386
#2387
#239
#2391
#2393
#2396
#24
#2401
#2406
#2406
#2407
#2410
#2414
#2416
#2417
#2419
#2424
#2425
#2427
#2429
#2431
#2432
#2432
#2433
#2440
#2443
#2444
#2449
#245
#2451
#2455
#2464
#2475
#2477
#2481
#2484
#2485
#2488
#2489
#2495
#2505
#2506
#2511
#2515
#2516
#2517
#2518
#2519
#252
#2523
#2525
#2528
#2529
#253
#2534
#254
#2543
#2546
#2547
#2548
#2549
#2550
#2551
#2555
#2560
#2560
#2561
#2562
#2567
#2569
#2569
#257
#2570
#2573
#2574
#2576
#2576
#2577
#2577
#2579
#2579
#2580
#2585
#2589
#2593
#2593
#260
#2604
#2608
#2611
#2613
#2617
#2623
#2624
#2625
#2629
#2633
#2634
#2634
#2635
#2637
#2638
#2639
#2641
#2642
#2643
#2648
#2649
#2653
#2654
#2656
#2659
#2663
#2664
#2670
#2674
#2676
#2683
#2684
#2684
#2686
#2687
#2690
#2690
#2691
#2691
#2692
#2693
#2694
#2694
#2699
#27
#2700
#2707
#2709
#271
#2711
#2716
#2718
#2728
#273
#2734
#2736
#2737
#274
#2745
#2749
#2756
#2756
#2759
#2759
#2760
#2760
#2769
#2769
#277
#2770
#2770
#28
#282
#284
#284
#285
#286
#287
#288
#29
#291
#294
#296
#298
#299
#3
#301
#302
#306
#308
#31
#317
#318
#319
#320
#322
#323
#324
#331
#336
#34
#340
#343
#343
#345
#346
#349
#350
#351
#353
#357
#359
#36
#362
#365
#366
#368
#369
#379
#38
#381
#383
#384
#387
#388
#390
#391
#398
#404
#409
#41
#415
#42
#424
#425
#43
#431
#435
#436
#439
#443
#444
#446
#451
#453
#454
#454
#455
#456
#459
#461
#462
#468
#473
#474
#476
#482
#484
#485
#486
#494
#495
#497
#500
#501
#502
#502
#503
#506
#515
#520
#523
#532
#534
#537
#538
#540
#542
#552
#563
#566
#569
#572
#576
#58
#583
#60
#600
#605
#613
#613
#615
#619
#624
#624
#626
#627
#628
#629
#629
#638
#640
#642
#645
#648
#649
#650
#650
#659
#659
#664
#668
#67
#677
#682
#685
#686
#687
#688
#697
#70
#704
#706
#710
#711
#712
#716
#718
#72
#720
#721
#725
#728
#733
#737
#739
#740
#755
#759
#760
#763
#764
#768
#77
#776
#78
#798
#81
#810
#811
#812
#815
#816
#832
#833
#834
#835
#836
#837
#842
#845
#853
#854
#862
#863
#867
#87
#871
#871
#874
#875
#883
#885
#890
#891
#891
#893
#899
#902
#908
#910
#915
#926
#927
#931
#935
#939
#939
#94
#944
#95
#956
#964
#968
#968
#971
#971
#972
#995
0.0.5-3
0.0.6-1
1.0.3
1.0.4
1.1.0
1.4.1-1
1.4.1-2
1.5.2
v1.0.4
v1.1.0
v1.1.1
v1.2.0
v1.2.1
v1.3.0
v1.4.0
v1.4.1
v1.4.2
v1.4.3
v1.5.0
v1.5.1
v1.5.2
v1.5.3
v1.5.4
v1.5.5
v1.6.0
v1.6.1
v1.6.2
v1.7.0
v1.7.1
v1.7.2
v1.7.2-pre
v1.7.3
v1.7.3-pre
v1.7.4
v1.7.4-pre-0
v1.7.4-pre-1
Select branches
Hide Pull Requests
arghh
avx512
batched
bench-memcpy
chess
coreml-with-state
cuda-cublas-opts
diarization
distil-support
experiment/model-compression
fa-decoder
feature/debug-gradle-signing
fix-bench
fix-coreml-ane
fix-vzip
gg/alloc-enc-results
gg/chess
gg/ci-cuda-fix
gg/ci-fix-android
gg/ci-fix-windows
gg/cuda-fix-mmvq
gg/cuda-no-async
gg/disable-cuda-graphs
gg/fix-external-encoder
gg/hipblas-fix
gg/objc
gg/prompt-tokens
gg/reduce-ctx-use
gg/wchess
ggml-backend
ggml-backend-no-sched
ggml-conv
grammar-debug
guided
java-bindings
large-v3
llama-podcast
macros-cvt-fp16
master
metal
metal-and-alloc
nvblas
parallel-states
quantize-encoder
stream
talk.llama-coreml
threads
timing
try-fix-abort
word-ts-2
#1001
#1002
#1003
#1010
#1012
#1015
#102
#1021
#1021
#1024
#1027
#1029
#1031
#1032
#1034
#1037
#1041
#1042
#1045
#1046
#1049
#1054
#1058
#1060
#1062
#1064
#1067
#107
#1074
#1074
#1077
#1081
#1086
#1086
#1092
#1097
#1097
#110
#1101
#111
#1110
#1111
#1112
#1113
#1114
#1115
#1118
#1118
#1120
#1124
#1128
#1129
#1130
#1131
#1134
#1136
#1137
#114
#1142
#1143
#1144
#1147
#1148
#115
#1154
#116
#1160
#1162
#1164
#1164
#1173
#1174
#1196
#1204
#1205
#1209
#121
#1210
#1211
#1212
#1214
#1216
#1217
#1218
#1220
#1224
#1227
#1228
#1229
#123
#1231
#1235
#1238
#124
#1243
#1247
#1250
#1251
#1253
#1254
#1255
#1261
#1261
#1263
#1264
#1265
#1267
#127
#127
#1270
#1275
#128
#1286
#1290
#1293
#1294
#1298
#130
#130
#1303
#1304
#1305
#1306
#131
#1310
#1313
#1317
#1330
#1334
#1335
#1345
#1349
#135
#1350
#1352
#1356
#1358
#136
#1362
#1364
#1368
#1370
#1375
#1375
#1380
#1381
#1381
#1382
#1389
#1400
#1404
#141
#1415
#1417
#1418
#1418
#1420
#1422
#1424
#143
#1432
#1434
#1440
#1441
#1442
#1444
#1445
#1452
#1455
#1455
#1456
#1457
#1458
#1459
#1462
#1466
#1467
#147
#1472
#1473
#1474
#1475
#1478
#1478
#1479
#1484
#1485
#1486
#1487
#1492
#1493
#1499
#1499
#150
#1500
#1500
#1501
#1505
#1519
#1521
#1522
#1523
#1524
#1524
#1529
#1530
#1533
#1534
#1535
#1539
#1541
#1544
#1545
#1546
#1547
#1548
#1549
#1549
#155
#1551
#1554
#1559
#1559
#1560
#1561
#1563
#1563
#1565
#1567
#1568
#1574
#1575
#1576
#1578
#1582
#1583
#1586
#1588
#1589
#1595
#160
#1602
#1604
#1604
#1605
#1606
#1607
#1615
#1617
#1627
#1627
#163
#1633
#1649
#1649
#1650
#1651
#1655
#1658
#1667
#1669
#1672
#1673
#1674
#1675
#1677
#1679
#1679
#1681
#1691
#1692
#1694
#1695
#170
#1701
#1703
#1704
#1713
#1714
#1716
#1717
#1725
#1727
#1728
#1729
#1735
#174
#1740
#1741
#1744
#1747
#1749
#175
#1750
#1753
#1754
#1755
#1758
#1763
#1764
#1765
#1768
#1768
#1772
#1774
#1778
#1781
#1785
#179
#1791
#1791
#1792
#1802
#1806
#1809
#1812
#1813
#1819
#1823
#1823
#183
#1833
#1833
#1838
#1839
#1840
#1841
#1841
#1842
#1850
#1854
#1854
#1857
#1859
#1860
#1861
#1863
#1865
#1871
#1872
#1874
#1878
#1888
#1889
#1890
#1891
#1895
#1897
#19
#1902
#1913
#1913
#1917
#1924
#1924
#1925
#1926
#1928
#1929
#193
#1932
#1933
#1938
#194
#1942
#1943
#1944
#1945
#1947
#195
#1952
#1952
#1953
#1964
#1965
#1966
#1969
#1969
#1970
#1973
#1973
#1978
#1978
#1980
#1981
#1982
#1983
#1990
#1990
#1994
#1997
#1998
#20
#2000
#2001
#2004
#2005
#2005
#201
#2012
#2019
#2020
#2024
#2025
#2026
#203
#203
#2043
#2044
#2045
#2048
#2049
#2054
#2058
#2063
#2068
#2068
#2069
#2070
#2071
#2071
#2072
#2073
#2075
#2075
#2080
#2086
#2088
#2090
#2094
#2095
#2095
#21
#2100
#2102
#2108
#2115
#2119
#2121
#2123
#2127
#2127
#2128
#2129
#2133
#2138
#2142
#2152
#2153
#2154
#2166
#2170
#2181
#2182
#2184
#2184
#2189
#2194
#2196
#2198
#2206
#2208
#2217
#222
#2220
#2227
#2231
#2232
#2234
#2235
#2236
#2237
#2238
#2239
#224
#2240
#2242
#2254
#2254
#2256
#2261
#2264
#2266
#2267
#2270
#2272
#2272
#2279
#2279
#228
#2288
#229
#2290
#2291
#2291
#2294
#2299
#23
#230
#2302
#231
#2311
#2324
#2330
#2330
#2336
#2339
#2339
#2342
#2343
#2346
#2350
#2358
#2360
#2367
#2369
#2369
#2376
#2376
#2382
#2383
#2384
#2384
#2386
#2387
#239
#2391
#2393
#2396
#24
#2401
#2406
#2406
#2407
#2410
#2414
#2416
#2417
#2419
#2424
#2425
#2427
#2429
#2431
#2432
#2432
#2433
#2440
#2443
#2444
#2449
#245
#2451
#2455
#2464
#2475
#2477
#2481
#2484
#2485
#2488
#2489
#2495
#2505
#2506
#2511
#2515
#2516
#2517
#2518
#2519
#252
#2523
#2525
#2528
#2529
#253
#2534
#254
#2543
#2546
#2547
#2548
#2549
#2550
#2551
#2555
#2560
#2560
#2561
#2562
#2567
#2569
#2569
#257
#2570
#2573
#2574
#2576
#2576
#2577
#2577
#2579
#2579
#2580
#2585
#2589
#2593
#2593
#260
#2604
#2608
#2611
#2613
#2617
#2623
#2624
#2625
#2629
#2633
#2634
#2634
#2635
#2637
#2638
#2639
#2641
#2642
#2643
#2648
#2649
#2653
#2654
#2656
#2659
#2663
#2664
#2670
#2674
#2676
#2683
#2684
#2684
#2686
#2687
#2690
#2690
#2691
#2691
#2692
#2693
#2694
#2694
#2699
#27
#2700
#2707
#2709
#271
#2711
#2716
#2718
#2728
#273
#2734
#2736
#2737
#274
#2745
#2749
#2756
#2756
#2759
#2759
#2760
#2760
#2769
#2769
#277
#2770
#2770
#28
#282
#284
#284
#285
#286
#287
#288
#29
#291
#294
#296
#298
#299
#3
#301
#302
#306
#308
#31
#317
#318
#319
#320
#322
#323
#324
#331
#336
#34
#340
#343
#343
#345
#346
#349
#350
#351
#353
#357
#359
#36
#362
#365
#366
#368
#369
#379
#38
#381
#383
#384
#387
#388
#390
#391
#398
#404
#409
#41
#415
#42
#424
#425
#43
#431
#435
#436
#439
#443
#444
#446
#451
#453
#454
#454
#455
#456
#459
#461
#462
#468
#473
#474
#476
#482
#484
#485
#486
#494
#495
#497
#500
#501
#502
#502
#503
#506
#515
#520
#523
#532
#534
#537
#538
#540
#542
#552
#563
#566
#569
#572
#576
#58
#583
#60
#600
#605
#613
#613
#615
#619
#624
#624
#626
#627
#628
#629
#629
#638
#640
#642
#645
#648
#649
#650
#650
#659
#659
#664
#668
#67
#677
#682
#685
#686
#687
#688
#697
#70
#704
#706
#710
#711
#712
#716
#718
#72
#720
#721
#725
#728
#733
#737
#739
#740
#755
#759
#760
#763
#764
#768
#77
#776
#78
#798
#81
#810
#811
#812
#815
#816
#832
#833
#834
#835
#836
#837
#842
#845
#853
#854
#862
#863
#867
#87
#871
#871
#874
#875
#883
#885
#890
#891
#891
#893
#899
#902
#908
#910
#915
#926
#927
#931
#935
#939
#939
#94
#944
#95
#956
#964
#968
#968
#971
#971
#972
#995
0.0.5-3
0.0.6-1
1.0.3
1.0.4
1.1.0
1.4.1-1
1.4.1-2
1.5.2
v1.0.4
v1.1.0
v1.1.1
v1.2.0
v1.2.1
v1.3.0
v1.4.0
v1.4.1
v1.4.2
v1.4.3
v1.5.0
v1.5.1
v1.5.2
v1.5.3
v1.5.4
v1.5.5
v1.6.0
v1.6.1
v1.6.2
v1.7.0
v1.7.1
v1.7.2
v1.7.2-pre
v1.7.3
v1.7.3-pre
v1.7.4
v1.7.4-pre-0
v1.7.4-pre-1
-
49c33aa40d
2024-12-11 13:49:59 -0800 -
262e865a70
2024-12-09 20:17:50 +0900 -
d4e47945e3
Use conditional get when get model files
Kitaiti Makoto
2024-12-03 23:15:15 +0900 -
a0f3d8a831
Cosmetic fix
Kitaiti Makoto
2024-12-01 08:26:47 +0900 -
4559a70035
Don't care about no longer included file
Kitaiti Makoto
2024-12-01 08:24:49 +0900 -
b8a5c85780
Remove unused function
Kitaiti Makoto
2024-12-01 08:04:27 +0900 -
0ed5b2399c
Add headings to API section in README [skip ci]
Kitaiti Makoto
2024-12-01 02:24:03 +0900 -
9e50697dc1
Update documents
Kitaiti Makoto
2024-12-01 02:10:01 +0900 -
d8d89d73e4
Add shorthand for pre-converted models
Kitaiti Makoto
2024-12-01 02:09:44 +0900 -
3fd13ae71f
Make Whisper::Context#initialize accept Pathname
Kitaiti Makoto
2024-11-29 22:09:55 +0900 -
d862e8359c
Add test for Pathname of model
Kitaiti Makoto
2024-11-29 22:05:52 +0900 -
b53b44e0ff
Use C++17
Kitaiti Makoto
2024-12-06 23:05:42 +0900 -
ed733e85a1
2024-12-09 11:30:16 +0200 -
5980b1ae77
2024-12-08 23:09:26 +0200 -
0415a66044
2024-12-08 23:07:29 +0200 -
7d134e3737
2024-12-08 23:04:26 +0200 -
9df53b357e
2024-12-08 22:48:25 +0200 -
b2115b4d9b
2024-12-08 22:48:14 +0200 -
0164427dd5
ci : disable freeBSD builds [no ci]
Georgi Gerganov
2024-12-08 15:52:57 +0200 -
627b11c78a
readme : update build instructions
Georgi Gerganov
2024-12-08 15:48:14 +0200 -
472464453d
ci : disable CUDA and Android builds
Georgi Gerganov
2024-12-08 15:36:01 +0200 -
11dddfbc9e
ci : disable Obj-C build + fixes
Georgi Gerganov
2024-12-08 13:35:35 +0200 -
384e214cc7
make : shim cmake
Georgi Gerganov
2024-12-06 15:34:53 +0200 -
f2c680f893
talk-llama : sync llama.cpp
Georgi Gerganov
2024-12-05 14:30:33 +0200 -
fbe66da0e5
sync : ggml
Georgi Gerganov
2024-12-05 14:29:18 +0200 -
a815940e0e
ggml : add predefined list of CPU backend variants to build (llama/10626)
Diego Devesa
2024-12-04 14:45:40 +0100 -
904e307bce
ggml-cpu : fix HWCAP2_I8MM value (llama/10646)
Diego Devesa
2024-12-04 14:40:44 +0100 -
491ec076b4
vulkan: Implement "fast divide" (mul+shift) for unary ops like copy (llama/10642)
Jeff Bolz
2024-12-04 01:28:59 -0600 -
966433fdf2
SYCL : Move to compile time oneMKL interface backend selection for NVIDIA backend (llama/10584)
Nicolò Scipione
2024-12-04 02:29:20 +0100 -
6f1ba9d82d
Avoid using __fp16 on ARM with old nvcc (llama/10616)
Frankie Robertson
2024-12-04 02:41:37 +0200 -
015ecd0001
vulkan: optimize and reenable split_k (llama/10637)
Jeff Bolz
2024-12-03 13:29:54 -0600 -
b7c64a4352
ggml: add
GGML_SET
Metal kernel + i32 CPU kernel (ggml/1037) PAB2024-12-04 09:19:30 +0100 -
7895d39508
ggml : add
GGML_PAD_REFLECT_1D
operation (ggml/1034) PAB2024-12-03 20:20:04 +0100 -
22616f00f9
files : remove make artifacts
Georgi Gerganov
2024-12-03 20:29:32 +0200 -
02c6fcbc2c
common : fix compile warning
Georgi Gerganov
2024-12-03 20:25:37 +0200 -
3daeacad24
ggml : move AMX to the CPU backend (llama/10570)
Diego Devesa
2024-12-03 20:22:12 +0200 -
4d73962da4
metal : small-batch mat-mul kernels (llama/10581)
Georgi Gerganov
2024-12-03 11:52:33 +0200 -
068812650e
SYCL: Fix and switch to GGML_LOG system instead of fprintf (llama/10579)
Akarshan Biswas
2024-12-02 12:34:11 +0530 -
4b7e059e15
ggml-cpu: replace AArch64 NEON assembly with intrinsics in ggml_gemv_q4_0_4x4_q8_0() (llama/10567)
Adrien Gallouët
2024-11-30 18:13:18 +0100 -
30e35d7271
vulkan: Dynamic subgroup size support for Q6_K mat_vec (llama/10536)
Eve
2024-11-30 07:00:02 +0000 -
3623bd58f2
ggml : fix I8MM Q4_1 scaling factor conversion (llama/10562)
Georgi Gerganov
2024-11-29 16:25:39 +0200 -
cb847c20a7
ggml-cpu: fix typo in gemv/gemm iq4_nl_4_4 (llama/10580)
Shupei Fan
2024-11-29 21:49:02 +0800 -
964b154a2a
sycl : offload of get_rows set to 0 (llama/10432)
Alberto Cabrera Pérez
2024-11-29 12:38:45 +0000 -
d7c2a04bce
sycl : Reroute permuted mul_mats through oneMKL (llama/10408)
Alberto Cabrera Pérez
2024-11-29 09:49:43 +0000 -
2bb4ca9cba
CANN: RoPE operator optimization (llama/10563)
Chenguang Li
2024-11-29 14:46:55 +0800 -
a753a82462
vulkan: get the first command buffer submitted sooner (llama/10499)
Jeff Bolz
2024-11-29 00:18:02 -0600 -
276b08d8f0
ggml : remove redundant copyright notice + update authors
Georgi Gerganov
2024-11-28 20:46:40 +0200 -
4ca1e72fe0
ggml : fix row condition for i8mm kernels (llama/10561)
Georgi Gerganov
2024-11-28 14:56:37 +0200 -
16a66f103f
cmake : fix ARM feature detection (llama/10543)
Georgi Gerganov
2024-11-28 14:56:23 +0200 -
330273901f
ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541)
Shupei Fan
2024-11-28 20:52:03 +0800 -
42099a9342
kompute : improve backend to pass test_backend_ops (llama/10542)
Sergio López
2024-11-28 12:51:38 +0100 -
90dd5fca9c
CANN: Fix SOC_TYPE compile bug (llama/10519)
leo-pony
2024-11-28 15:25:24 +0800 -
2490f2a7f8
CANN: ROPE operator optimization (llama/10540)
Chenguang Li
2024-11-28 14:24:46 +0800 -
230e985633
Add some minimal optimizations for CDNA (llama/10498)
uvos
2024-11-27 17:10:08 +0100 -
ae24083f23
metal : fix group_norm support condition (llama/0)
Georgi Gerganov
2024-11-27 11:22:14 +0200 -
6463e36369
vulkan: define all quant data structures in types.comp (llama/10440)
Jeff Bolz
2024-11-27 01:32:54 -0600 -
b3301f7d82
vulkan: Handle GPUs with less shared memory (llama/10468)
Jeff Bolz
2024-11-27 01:30:27 -0600 -
ab5d4d93ec
vulkan: further optimize q5_k mul_mat_vec (llama/10479)
Jeff Bolz
2024-11-27 01:21:59 -0600 -
2d6e9dd723
vulkan: skip integer div/mod in get_offsets for batch_idx==0 (llama/10506)
Jeff Bolz
2024-11-27 01:08:54 -0600 -
2f16e51553
vulkan: optimize Q2_K and Q3_K mul_mat_vec (llama/10459)
Jeff Bolz
2024-11-27 01:00:50 -0600 -
0f0994902f
mtgpu: Add MUSA_DOCKER_ARCH in Dockerfiles && update cmake and make (llama/10516)
R0CKSTAR
2024-11-27 00:00:41 +0800 -
5e1fcc1780
vulkan: fix group_norm (llama/10496)
Jeff Bolz
2024-11-26 09:45:05 -0600 -
48f421de23
cmake : enable warnings in llama (llama/10474)
Georgi Gerganov
2024-11-26 14:18:08 +0200 -
e7afb2b991
ggml-cpu: cmake add arm64 cpu feature check for macos (llama/10487)
Charles Xu
2024-11-26 12:37:05 +0100 -
9a5ef7b169
CANN: Improve the Inferencing Performance for Ascend NPU Device (llama/10454)
Shanshan Shen
2024-11-26 18:08:37 +0800 -
453cc0fcf1
CANN: RoPE and CANCAT operator optimization (llama/10488)
Chenguang Li
2024-11-26 17:31:05 +0800 -
78dfec6bc5
vulkan: Fix a vulkan-shaders-gen arugment parsing error (llama/10484)
Junil Kim
2024-11-26 10:47:20 +0900 -
f6d518fc4c
metal : enable mat-vec kernels for bs <= 4 (llama/10491)
Georgi Gerganov
2024-11-25 21:49:31 +0200 -
ac33379a35
llama : accept a list of devices to use to offload a model (llama/10497)
Diego Devesa
2024-11-25 19:30:06 +0100 -
77e3e4a090
ggml : add support for dynamic loading of backends (llama/10469)
Diego Devesa
2024-11-25 15:13:39 +0100 -
b840bb09be
metal : minor code formatting
Georgi Gerganov
2024-11-25 15:08:04 +0200 -
8b1c1c30a7
ggml : do not use ARM features not included in the build (llama/10457)
Diego Devesa
2024-11-23 14:41:12 +0100 -
4b81335f75
CANN: Support Ascend310P to accelerate F32 and F16 Model (llama/10216)
leo-pony
2024-11-22 14:07:20 +0800 -
2a4b5c9d7e
cuda : optimize argmax (llama/10441)
Diego Devesa
2024-11-21 18:18:50 +0100 -
04662748aa
vulkan: predicate max operation in soft_max shaders/soft_max (llama/10437)
Jeff Bolz
2024-11-20 13:47:36 -0600 -
a117279e13
vulkan: copy iq4_nl LUT into shared memory (llama/10409)
Jeff Bolz
2024-11-20 01:40:18 -0600 -
bbb292ed38
vulkan: further optimize mul_mat_vec using larger loads (llama/10387)
Jeff Bolz
2024-11-20 01:11:00 -0600 -
95e8901e71
add cmake rvv support (llama/10411)
haopeng
2024-11-20 04:10:31 +0800 -
4af9626702
CUDA: remove unnecessary warp reduce in FA (ggml/1032)
mahorozte
2024-12-03 21:11:43 +0800 -
c52d1035de
feat: add
GGML_UNARY_OP_ARGMAX
Metal kernel (ggml/1019) PAB2024-12-02 19:27:24 +0100 -
5773a14980
metal : add
GGML_OP_CONV_TRANSPOSE_1D
kernels (ggml/1026) PAB2024-11-28 09:25:06 +0100 -
6939147c47
Do not include arm_neon.h when compiling CUDA code (ggml/1028)
Frankie Robertson
2024-11-26 15:50:26 +0200 -
98f9916c9f
ggml-opt: fix data corruption (ggml/1022)
Johannes Gäßler
2024-11-20 14:56:04 +0100 -
280d2735bc
2024-12-08 15:52:57 +0200 -
668930a989
2024-12-08 15:48:14 +0200 -
762f63e2d0
2024-12-08 15:36:01 +0200 -
a5cd03a921
2024-12-08 13:35:35 +0200 -
e3d545e5a6
Fix vulkan Makefile paths
Lluís Batlle i Rossell
2024-12-07 12:27:48 +0100 -
ae769eae71
Use C++17
Kitaiti Makoto
2024-12-06 23:05:42 +0900 -
729effe4cf
2024-12-06 15:34:53 +0200 -
1a1fcd37cf
2024-12-05 14:30:33 +0200 -
dfe6652b0d
2024-12-05 14:29:18 +0200 -
dfddca02ec
2024-12-04 14:45:40 +0100 -
61aff48839
2024-12-04 14:40:44 +0100 -
b311da34cf
2024-12-04 01:28:59 -0600 -
3085e2883a
2024-12-04 02:29:20 +0100 -
9623ba19b2
2024-12-04 02:41:37 +0200 -
03331b1de8
2024-12-03 13:29:54 -0600 -
e20efac003
GGML_SET
Metal kernel + i32 CPU kernel (ggml/1037) PAB2024-12-04 09:19:30 +0100 -
40d5987bf3
GGML_PAD_REFLECT_1D
operation (ggml/1034) PAB2024-12-03 20:20:04 +0100