Spaces:
Sleeping
Sleeping
Commit History
ruby : add VAD support, migration to Ruby's newer API (#3197)
1ee7297
unverified
whisper : install shared libs when using GGML_BACKEND_DL (#3195)
f44915b
unverified
tests : add a new benchmark test for long-form audio (#3185)
2d5018c
unverified
ci : update windows-blas uploads action (#3192)
3b0cb7a
unverified
sync : fix builds - musa, ruby
84007bb
talk-llama : sync llama.cpp
5d037b9
sync : ggml
78bfd81
ggml : riscv: add xtheadvector support (llama/13720)
4790d5d
xctan
commited on
ggml-cpu: x86 feature detection is specific to x86 (llama/13811)
d86ba47
Christian Kastner
commited on
ggml : allow CUDA graphs when using pipeline parallelism (llama/13814)
b85e3c0
Diego Devesa
commited on
cuda : avoid cuGetErrorString (llama/13791)
cdf95d3
SYCL: Add non contiguous support in RMS_NORM and NORM kernels (llama/13611)
5de15cd
Akarshan Biswas
commited on
sycl: Add more debug prints (llama/13640)
4da3fb6
Romain Biessy
commited on
vulkan: mark IM2COL as supporting non-contig (llama/13783)
09c03ad
CANN: Add the basic supports of Flash Attention kernel (llama/13627)
112c144
Bizhao Shi
commited on
SYCL: revert "sycl: simplify bin_bcast_kernel (ggml/13383)" (llama/13752)
8c2a700
Akarshan Biswas
commited on
ggml-cpu : set openmp wait time if not set (llama/13758)
276d920
Diego Devesa
commited on
ggml : add ggml_gelu_erf() CUDA kernel (llama/13719)
b154325
CUDA: fix race condition in FA vector kernels (llama/13742)
38a702a
CANN: Support MUL_MAT_ID for q8_0 and q4_0 (llama/13705)
6a9f9dc
Chenguang Li
commited on
ggml : fix the order of ggml_unary_op (llama/13718)
bdae2b3
vulkan: support CPY from any type to itself (llama/13695)
f5f766b
vulkan: Disable coopmat/coopmat2/bfloat extensions if glslc doesn't support it (llama/13696)
69679f5
use LOG_WARN to replace `std::cerr` (llama/13657)
6975ec2
Judd
commited on
sycl : Remove waits from function calls (llama/13702)
b9bf6b6
Nicolò Scipione
commited on
SYCL: Avoid using with SYCL-Graph for unsupported nodes (llama/13587)
7eb0e6e
Ewan Crawford
commited on
opencl: Add support for multiple devices (llama/12622)
b6cddb5
Henry Linjamäki
commited on
opencl: fix couple crashes (llama/12795)
2eea73d
Henry Linjamäki
commited on
ggml : add ggml_gelu_erf() (llama/13667)
6c9cd9a
musa: Upgrade MUSA SDK version to rc4.0.1 and use mudnn::Unary::IDENTITY op to accelerate D2D memory copy (llama/13647)
9506ebb
vulkan: fix warnings (llama/13626)
8602d10
Eve
commited on
CUDA: skip fully masked-out KV in FA vec kernel (llama/13584)
e1f825c
sycl: disable reorder for sycl mulmat (llama/13536)
e023dc2
Svetlozar Georgiev
commited on
metal : fix typo in FA kernel comments (llama/13651)
4c32ada
sycl : Overcoming workaround for mmap() allocation on Windows (llama/13482)
bf74ede
Nicolò Scipione
commited on
Vulkan: Add f32 accumulator support to quantized mul mat to fix GLM4 32B incoherence (llama/13607)
dfa38af
sync : ggml
3b09d20
docs : convert README_sycl.md to utf8 format [no ci] (#3191)
2384106
unverified
node : enable no_prints to suppress all output (#3189)
1b2bc05
unverified
talk-llama : fix for swedish umlauts + expose model inference settings in talk-llama.cpp (#3187)
1473e33
unverified
docs : fix VAD section heading levels (#3186)
a7bcfbf
unverified
ci : use dynamic libopenblas.dll for window-blas (#3177)
bafccd1
unverified
server : Add k6 Load Testing Script (#3175)
9a681c7
unverified
docs : add VAD model download instructions [no ci] (#3180)
e789f73
unverified
docs : replace typo "]"with ")" in README (#3179)
5e8b0f0
unverified
Alpaim
commited on
whisper : remove redundant assignments (#3178)
ec40497
unverified
whisper : update CMakeLists.txt to handle deprecated gpu Warnings (#3163)
2ee9c36
unverified
Jugal Haresh Sheth
Jugal Sheth
commited on