Default Branch

7ffcd05267 · ruby : Make context accept initial parameters, API to retrieve a segment and more (#2749) · Updated 2025-01-21 08:39:54 +01:00

Branches

e2aa556a99 · whisper : experiments with Flash Attention in the decoder · Updated 2023-01-07 20:00:51 +01:00

1665
1

4e6d2e98ab · ggml : try to improve threading · Updated 2022-12-29 12:05:20 +01:00

1705
1

683f111088 · ggml : initial tests with libnvblas · Updated 2022-12-08 21:01:52 +01:00

1784
1

e0bd97f41f · ggml : use macros to inline FP16 <-> FP32 conversions · Updated 2022-12-06 21:05:33 +01:00

1788
1

0a2621b637 · stream : add "max_tokens" cli arg · Updated 2022-11-20 20:22:02 +01:00

1866
5

fa9621e5e9 · mtl : update Makefile to support Metal · Updated 2022-11-12 07:32:03 +01:00

1876
2

5d895d60b6 · Merge branch 'master' into avx512 · Updated 2022-11-06 08:09:50 +01:00

1881
7

210a6fb83c · wip : some unsuccessful experiments using DP · Updated 2022-11-01 20:28:30 +01:00

1904
1

4597c9c19b · wip : try to compress just mlp · Updated 2022-10-08 14:12:15 +02:00

2018
2