Spaces:
Sleeping
Sleeping
Commit History
ggml-ci: add run.sh (#2877)
3e264ce
unverified
examples : add dl to the list of libraries linked (#2875)
dc2a512
unverified
whisper: add xcframework build script (#2873)
9797241
unverified
objc : fix build, tmp remove GPU support, use C++17
e1b5e01
cmake : fix ggml-config (ggml/0)
40f0325
sync : ggml
f18e587
ggml-cpu: faster AVX2 variant for IQ1_M (llama/12216)
591cbfb
Rémy O
commited on
metal : simplify kernel arguments using a struct (ggml/3229) (llama/12194)
092277a
BB-fat
alexju
commited on
metal : fix default.metallib build (llama/12224)
838efb6
opencl: Noncontiguous `norm`, `rms_norm`, disable `fp16` for some ops (llama/12217)
94449e3
lhez
commited on
cmake : fix undefined reference errors for std::filesystem in ggml (#12092) (llama/12094)
dc68418
xiaofei
Ray Lee
commited on
CUDA: fix FA logic for PTX 7.0 and CC >= 7.5 (llama/12222)
4dc8a81
HIP/CUDA: set the paramerter value in maintain_cuda_graph instead of replaceing it. (llama/12209)
18afa4b
uvos
commited on
opencl : fix buffer alignment (llama/12197)
7d25156
opencl : fix `ulong` kernel args were set from `int` variables (llama/12174)
67ffff0
opencl : fix profile-related errors (llama/12095)
e11a847
simon886212
ubuntu
commited on
ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (llama/12154)
05466a9
Rémy O
commited on
SYCL: Disable f16 Unary OPs as not supported by the kernels (llama/12201)
723b8b4
Akarshan Biswas
commited on
ggml : fix GGMLMetalClass ODR (llama/12200)
2094cb7
ggml : ggml_compute_forward_concat() for arbitrary tensor type (ggml/1118)
c9a49f9
vmobilis
commited on
vulkan : sync (llama/0)
4c17fa1
ggml : portability fixes for VS 2017 (llama/12150)
49e3343
mgroeber9110
Marcus Groeber
commited on
HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032)
a027c1d
David Huang
commited on
ggml : fix kleidiai build (llama/12159)
dbc0180
ag2s20150909
commited on
SYCL: Move CPY kernels to a separate file and add few missing kernels (llama/12133)
1d6d451
Akarshan Biswas
commited on
ggml-backend : keep paths in native string type when possible (llama/12144)
6e89d8c
Diego Devesa
commited on
CUDA: compress mode option and default to size (llama/12029)
4ec988a
Erik Scholz
commited on
ggml : upgrade init_tensor API to return a ggml_status (llama/11854)
d6b6852
William Tambellini
slaren
commited on
vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations (llama/11595)
d7d82b9
Rémy O
commited on
CUDA: fix logic for V100 + GGML_CUDA_FORCE_MMQ (llama/12098)
0b52fcc
ggml: aarch64: implement SVE kernels for q2_k_q8_k vector dot (llama/12064)
459beb1
Prashant Vithule
vithulep
commited on
CANN: Fix build error with GCC 13 (llama/11990)
dcf68db
vulkan: matmul dequantization improvements (llama/12015)
ffdf466
Eve
commited on
vulkan: improve im2col (llama/11826)
f6cff0a
Daniele
commited on
cmake: Fix ggml backend dependencies and installation (llama/11818)
c6c2a2c
Vladimir Vuksanovic
commited on
vulkan: fix assertion when qy_needs_dequant (llama/12068)
271c7e4
ggml-cpu: Fix build with sve (llama/12059)
4be146e
cuda: unary ops as float + de-duplicate (ggml/1130)
4bec2e4
cmdr2
commited on
cuda/vulkan: specify fp32-only support for some operations in supports_op (ggml/1129)
f959b90
cmdr2
commited on
cuda/cpu: Increase support for fp16 unary operations (ggml/1125)
67e8c32
cmdr2
commited on
Told cmake to install ggml-cpp.h as a public header file. (ggml/1126)
3d4f29c
petterreinholdtsen
Petter Reinholdtsen
commited on
common : more general m_audio_len update logic (#2855)
4674264
unverified
Ivy233
Ivy233
commited on
go : improve model download (#2756)
168712d
unverified
Ryan Johnson
commited on
common : fix audio loading by miniaudio (#2862)
494fb84
unverified
Dmitry Atamanov
commited on
fix: missing include common-whisper (#2858)
2271d56
unverified
Lin Xiaodong
commited on