Spaces:
Sleeping
Sleeping
Commit History
ggml : fix compile warnings (llama/0)
80d6ec0
llamafile : fix include path (llama/0)
e443f89
vulkan: Optimize some mat-vec mul quant shaders (llama/10296)
dc0e685
ggml : optimize Q4_0 into Q4_0_X_Y repack (llama/10324)
abf6f22
Dan Johansson
commited on
Make updates to fix issues with clang-cl builds while using AVX512 flags (llama/10314)
2868c2b
Srihari-mcw
commited on
ggml: new optimization interface (ggml/988)
dd33ace
ggml : remove duplicated sources from the last sync (ggml/1017)
026d20b
ggml : fix some build issues
c5ba1d1
slaren
commited on
sync : leftovers (ggml/0)
0f6c498
cmake : restore CMakeLists.txt (llama/10256)
51a70ff
AVX BF16 and single scale quant optimizations (llama/10212)
e6ffed3
Eve
commited on
sycl: Use syclcompat::dp4a (llama/10267)
ce0dc30
Romain Biessy
commited on
backend cpu: add online flow for aarch64 Q4_0 GEMV/GEMM kernels (llama/9921)
3541ee8
Charles Xu
Diego Devesa
commited on
ggml : build backends as libraries (llama/10256)
3dc93f3
scripts : update sync
1741306
release : v1.7.2
414329d
unverified
sycl: fix example build (#2570)
a0dcffc
unverified
Stefan Sydow
commited on
ci : use local ggml in Android build (#2567)
72b7501
unverified
ggml : tmp workaround for whisper.cpp (skip) (#2565)
ef26f48
unverified
update : readme
d1fa03c
unverified
scripts : fix sync path
9a2f912
unverified
whisper.swiftui : switch Mac dest to Mac (Designed for iPad) (#2562)
13f2beb
unverified
cmake : fix ppc64 check (#0)
f3c3fca
whisper : include ggml-cpu.h (#0)
cb35171
build : fixes
11d19cb
talk-llama : sync llama.cpp
6bb34fb
whisper : fix build (#0)
dfd316d
sync : ggml
9e83be6
sycl : Fixes to broken builds and test-backend-ops (llama/10257)
9cfb13b
Alberto Cabrera Pérez
commited on
vulkan: Optimize contiguous copies (llama/10254)
9974bd6
vulkan: Throttle the number of shader compiles during the build step. (llama/10222)
9677a2f
metal : more precise Q*K in FA vec kernel (llama/10247)
9160e8f
vulkan: Fix newly added tests for permuted mul_mat and 1D im2col (llama/10226)
76b8073
metal : reorder write loop in mul mat kernel + style (llama/10231)
661360d
metal : fix build and some more comments (llama/10229)
93fc215
metal : fix F32 accumulation in FA vec kernel (llama/10232)
228e0b2
metal : hide debug messages from normal log
efefcbb
ggml: fix zero division in ‘dne’ calculation in CUDA COUNT_EQUAL operator when ‘ne’ is small (#10213)
0ecc4d6
ggml : optimize llamafile cpu matrix multiplication for ppc64le (llama/10156)
18bdb35
amritahs-ibm
commited on
metal : opt-in compile flag for BF16 (llama/10218)
5f667d1
metal : improve clarity (minor) (llama/10171)
d68ae7c
metal : optimize FA kernels (llama/10171)
44ff932
ggml : add ggml-cpu.h to the public headers (llama/10204)
936a35f
Diego Devesa
commited on
fix q4_0_8_8 format for corrupted tokens issue (llama/10198)
4700b48
snadampal
EC2 Default User
commited on
metal : add BF16 support (llama/8439)
847669b
metal : fix from ptr buffer name (llama/10189)
c4d59b9
Diego Devesa
commited on