Spaces:
Sleeping
Sleeping
Commit History
whisper : update FA call
2bfec97
sync : ggml
7ba8c97
sync : vulkan (skip) (llama/0)
5fe3dd6
ggml : do not crash when quantizing q4_x_x with an imatrix (llama/9192)
d64f932
slaren
commited on
metal : separate scale and mask from QKT in FA kernel (llama/9189)
90cc3cd
ggml : add SSM Metal kernels (llama/8546)
b6e7294
metal : gemma2 flash attention support (llama/9159)
e62fd15
slaren
commited on
CPU/CUDA: Gemma 2 FlashAttention support (llama/8542)
fb8ae8b
Add a space to supress a cmake warning (llama/9133)
287612e
Add oneDNN primitive support (llama/9091)
b4d8c3e
fallback mmvq (llama/9088)
4b1fda0
Fix SYCL `im2col` and `convert` Overflow with Large Dims (llama/9052)
5f43886
rpc : print error message when failed to connect endpoint (llama/9042)
d54b156
rpc : prevent crashes on invalid input (llama/9040)
656ae00
ggml : dynamic ggml_sched_max_splits based on graph_size (llama/9047)
e0dc1ad
cmake : remove unused option GGML_CURL (llama/9011)
12634fc
ggml : move rope type enum to ggml.h (llama/8949)
9d45f48
ggml: fix div-by-zero (llama/9003)
d9ee26f
DavidKorczynski
commited on
Optimize Vulkan backend for better CPU performance and less GPU synchronization overhead. (llama/8943)
11bc9e6
feat: ref. cross entropy, add CUDA, fix grad test (ggml/929)
e1e87a3
ggml: remove bad assert (ggml/928)
ba483f7
examples: add MNIST training + missing ops
0828065
models : add support for wget2 for fedora (#2387)
0653499
unverified
Brad Murray
commited on
readme : update the path to bench.py (#2386)
57c7a6b
unverified
Peng
commited on
readme : fix typo (#2383)
16e5a16
unverified
readme : fix broken links in implementation details section (#2382)
4863dee
unverified
stormofice
commited on
whisper : fix compile warning for unused params
0e05e03
unverified
sync : ggml vulkan (ggml/0)
c4c7e49
ggml : fix typo in ggml-quants.c comment (ggml/922)
f158bc0
feat: add new `sin` and `cos` operators (ggml/919)
f541d31
readme : fix broken links (#2358)
93e1056
unverified
examples : use colorblind friendly TTY color scheme (#2360)
09303a2
unverified
Justine Tunney
commited on
sync : ggml
e6d1739
unverified
ggml : support forward pass broadcasting in ggml_sub (ggml/914)
0af2d37
unverified
metal : fix uninitialized abort_callback (llama/8968)
f971b60
unverified
slaren
commited on
rpc : sanitize tensor data + warnings (llama/0)
87d58fe
unverified
cann : add Ascend NPU support (#2336)
94baae9
unverified
whisper : fix compile warning (#0)
1a699ea
sync : ggml
acf76b7
ggml : add CANN backend (llama/0)
7c34a03
scripts : sync cann
0a74031
ci : disable ruby workflow (#0)
4b0eff8
ci : try to fix FreeBSD (#0)
683de5a
build : fix aarch64 (#0)
55befbb
talk-llama : sync llama.cpp
a40d0a7
sync : ggml
96e8b15
ggml-backend : fix async copy from CPU (llama/8897)
050174c
slaren
commited on