Commits · natasa365/whisper.cpp

whisper : remove whisper_load_backends function (#3196)

0cae2d6
unverified

danbev commited on May 29, 2025

ruby : add VAD support, migration to Ruby's newer API (#3197)

1ee7297
unverified

KitaitiMakoto commited on May 28, 2025

whisper : install shared libs when using GGML_BACKEND_DL (#3195)

f44915b
unverified

peardox commited on May 28, 2025

tests : add a new benchmark test for long-form audio (#3185)

2d5018c
unverified

fujimotos commited on May 28, 2025

ci : update windows-blas uploads action (#3192)

3b0cb7a
unverified

danbev commited on May 27, 2025

sync : fix builds - musa, ruby

84007bb

ggerganov commited on May 27, 2025

talk-llama : sync llama.cpp

5d037b9

ggerganov commited on May 27, 2025

sync : ggml

78bfd81

ggerganov commited on May 27, 2025

ggml : riscv: add xtheadvector support (llama/13720)

4790d5d

xctan commited on May 27, 2025

ggml-cpu: x86 feature detection is specific to x86 (llama/13811)

d86ba47

Christian Kastner commited on May 27, 2025

ggml : allow CUDA graphs when using pipeline parallelism (llama/13814)

b85e3c0

Diego Devesa commited on May 27, 2025

cuda : avoid cuGetErrorString (llama/13791)

cdf95d3

ggerganov commited on May 26, 2025

SYCL: Add non contiguous support in RMS_NORM and NORM kernels (llama/13611)

5de15cd

Akarshan Biswas commited on May 26, 2025

sycl: Add more debug prints (llama/13640)

4da3fb6

Romain Biessy commited on May 26, 2025

vulkan: mark IM2COL as supporting non-contig (llama/13783)

09c03ad

jeffbolznv commited on May 26, 2025

CANN: Add the basic supports of Flash Attention kernel (llama/13627)

112c144

Bizhao Shi commited on May 26, 2025

SYCL: revert "sycl: simplify bin_bcast_kernel (ggml/13383)" (llama/13752)

8c2a700

Akarshan Biswas commited on May 25, 2025

ggml-cpu : set openmp wait time if not set (llama/13758)

276d920

Diego Devesa commited on May 24, 2025

ggml : add ggml_gelu_erf() CUDA kernel (llama/13719)

b154325

ngxson HF Staff commited on May 24, 2025

CUDA: fix race condition in FA vector kernels (llama/13742)

38a702a

JohannesGaessler commited on May 24, 2025

CANN: Support MUL_MAT_ID for q8_0 and q4_0 (llama/13705)

6a9f9dc

Chenguang Li commited on May 23, 2025

ggml : fix the order of ggml_unary_op (llama/13718)

bdae2b3

ngxson HF Staff commited on May 23, 2025

vulkan: support CPY from any type to itself (llama/13695)

f5f766b

jeffbolznv commited on May 23, 2025

vulkan: Disable coopmat/coopmat2/bfloat extensions if glslc doesn't support it (llama/13696)

69679f5

jeffbolznv commited on May 23, 2025

use LOG_WARN to replace `std::cerr` (llama/13657)

6975ec2

Judd commited on May 23, 2025

sycl : Remove waits from function calls (llama/13702)

b9bf6b6

Nicolò Scipione commited on May 22, 2025

SYCL: Avoid using with SYCL-Graph for unsupported nodes (llama/13587)

7eb0e6e

Ewan Crawford commited on May 22, 2025

opencl: Add support for multiple devices (llama/12622)

b6cddb5

Henry Linjamäki commited on May 21, 2025

opencl: fix couple crashes (llama/12795)

2eea73d

Henry Linjamäki commited on May 21, 2025

ggml : add ggml_gelu_erf() (llama/13667)

6c9cd9a

ngxson HF Staff commited on May 21, 2025

musa: Upgrade MUSA SDK version to rc4.0.1 and use mudnn::Unary::IDENTITY op to accelerate D2D memory copy (llama/13647)

9506ebb

yeahdongcn

JohannesGaessler commited on May 21, 2025

vulkan: fix warnings (llama/13626)

8602d10

Eve commited on May 20, 2025

CUDA: skip fully masked-out KV in FA vec kernel (llama/13584)

e1f825c

JohannesGaessler commited on May 20, 2025

sycl: disable reorder for sycl mulmat (llama/13536)

e023dc2

Svetlozar Georgiev commited on May 20, 2025

metal : fix typo in FA kernel comments (llama/13651)

4c32ada

ggerganov commited on May 20, 2025

sycl : Overcoming workaround for mmap() allocation on Windows (llama/13482)

bf74ede

Nicolò Scipione commited on May 20, 2025

Vulkan: Add f32 accumulator support to quantized mul mat to fix GLM4 32B incoherence (llama/13607)

dfa38af

OccamRazor commited on May 19, 2025

sync : ggml

3b09d20

ggerganov commited on May 27, 2025

docs : convert README_sycl.md to utf8 format [no ci] (#3191)

2384106
unverified

danbev commited on May 27, 2025

node : enable no_prints to suppress all output (#3189)

1b2bc05
unverified

danbev commited on May 27, 2025

talk-llama : fix for swedish umlauts + expose model inference settings in talk-llama.cpp (#3187)

1473e33
unverified

matteng1

ggerganov commited on May 26, 2025

docs : fix VAD section heading levels (#3186)

a7bcfbf
unverified

KitaitiMakoto commited on May 23, 2025

ci : use dynamic libopenblas.dll for window-blas (#3177)

bafccd1
unverified

danbev commited on May 23, 2025

server : Add k6 Load Testing Script (#3175)

9a681c7
unverified

sachaarbonel commited on May 22, 2025

docs : add VAD model download instructions [no ci] (#3180)

e789f73
unverified

danbev commited on May 22, 2025

docs : replace typo "]"with ")" in README (#3179)

5e8b0f0
unverified

Alpaim commited on May 22, 2025

whisper : remove redundant assignments (#3178)

ec40497
unverified

danbev commited on May 21, 2025

whisper : update CMakeLists.txt to handle deprecated gpu Warnings (#3163)

2ee9c36
unverified

Jugal Haresh Sheth Jugal Sheth commited on May 20, 2025

ruby : add GGML_SYCL_DNN option to ruby bindings (#3172)

94d5ce3
unverified

danbev commited on May 19, 2025

talk-llama : sync llama.cpp

44ee199

ggerganov commited on May 19, 2025

Commit History

whisper : remove whisper_load_backends function (#3196) 0cae2d6 unverified

ruby : add VAD support, migration to Ruby's newer API (#3197) 1ee7297 unverified

whisper : install shared libs when using GGML_BACKEND_DL (#3195) f44915b unverified

tests : add a new benchmark test for long-form audio (#3185) 2d5018c unverified

ci : update windows-blas uploads action (#3192) 3b0cb7a unverified

sync : fix builds - musa, ruby 84007bb

talk-llama : sync llama.cpp 5d037b9

sync : ggml 78bfd81

ggml : riscv: add xtheadvector support (llama/13720) 4790d5d

ggml-cpu: x86 feature detection is specific to x86 (llama/13811) d86ba47

ggml : allow CUDA graphs when using pipeline parallelism (llama/13814) b85e3c0

cuda : avoid cuGetErrorString (llama/13791) cdf95d3

SYCL: Add non contiguous support in RMS_NORM and NORM kernels (llama/13611) 5de15cd

sycl: Add more debug prints (llama/13640) 4da3fb6

vulkan: mark IM2COL as supporting non-contig (llama/13783) 09c03ad

CANN: Add the basic supports of Flash Attention kernel (llama/13627) 112c144

SYCL: revert "sycl: simplify bin_bcast_kernel (ggml/13383)" (llama/13752) 8c2a700

ggml-cpu : set openmp wait time if not set (llama/13758) 276d920

ggml : add ggml_gelu_erf() CUDA kernel (llama/13719) b154325

CUDA: fix race condition in FA vector kernels (llama/13742) 38a702a

CANN: Support MUL_MAT_ID for q8_0 and q4_0 (llama/13705) 6a9f9dc

ggml : fix the order of ggml_unary_op (llama/13718) bdae2b3

vulkan: support CPY from any type to itself (llama/13695) f5f766b

vulkan: Disable coopmat/coopmat2/bfloat extensions if glslc doesn't support it (llama/13696) 69679f5

use LOG_WARN to replace `std::cerr` (llama/13657) 6975ec2

sycl : Remove waits from function calls (llama/13702) b9bf6b6

SYCL: Avoid using with SYCL-Graph for unsupported nodes (llama/13587) 7eb0e6e

opencl: Add support for multiple devices (llama/12622) b6cddb5

opencl: fix couple crashes (llama/12795) 2eea73d

ggml : add ggml_gelu_erf() (llama/13667) 6c9cd9a

musa: Upgrade MUSA SDK version to rc4.0.1 and use mudnn::Unary::IDENTITY op to accelerate D2D memory copy (llama/13647) 9506ebb

vulkan: fix warnings (llama/13626) 8602d10

CUDA: skip fully masked-out KV in FA vec kernel (llama/13584) e1f825c

sycl: disable reorder for sycl mulmat (llama/13536) e023dc2

metal : fix typo in FA kernel comments (llama/13651) 4c32ada

sycl : Overcoming workaround for mmap() allocation on Windows (llama/13482) bf74ede

Vulkan: Add f32 accumulator support to quantized mul mat to fix GLM4 32B incoherence (llama/13607) dfa38af

sync : ggml 3b09d20

docs : convert README_sycl.md to utf8 format [no ci] (#3191) 2384106 unverified

node : enable no_prints to suppress all output (#3189) 1b2bc05 unverified

talk-llama : fix for swedish umlauts + expose model inference settings in talk-llama.cpp (#3187) 1473e33 unverified

docs : fix VAD section heading levels (#3186) a7bcfbf unverified

ci : use dynamic libopenblas.dll for window-blas (#3177) bafccd1 unverified

server : Add k6 Load Testing Script (#3175) 9a681c7 unverified

docs : add VAD model download instructions [no ci] (#3180) e789f73 unverified

docs : replace typo "]"with ")" in README (#3179) 5e8b0f0 unverified

whisper : remove redundant assignments (#3178) ec40497 unverified

whisper : update CMakeLists.txt to handle deprecated gpu Warnings (#3163) 2ee9c36 unverified

ruby : add GGML_SYCL_DNN option to ruby bindings (#3172) 94d5ce3 unverified

talk-llama : sync llama.cpp 44ee199

whisper : remove whisper_load_backends function (#3196)

0cae2d6
unverified

ruby : add VAD support, migration to Ruby's newer API (#3197)

1ee7297
unverified

whisper : install shared libs when using GGML_BACKEND_DL (#3195)

f44915b
unverified

tests : add a new benchmark test for long-form audio (#3185)

2d5018c
unverified

ci : update windows-blas uploads action (#3192)

3b0cb7a
unverified

sync : fix builds - musa, ruby

84007bb

talk-llama : sync llama.cpp

5d037b9

sync : ggml

78bfd81

ggml : riscv: add xtheadvector support (llama/13720)

4790d5d

ggml-cpu: x86 feature detection is specific to x86 (llama/13811)

d86ba47

ggml : allow CUDA graphs when using pipeline parallelism (llama/13814)

b85e3c0

cuda : avoid cuGetErrorString (llama/13791)

cdf95d3

SYCL: Add non contiguous support in RMS_NORM and NORM kernels (llama/13611)

5de15cd

sycl: Add more debug prints (llama/13640)

4da3fb6

vulkan: mark IM2COL as supporting non-contig (llama/13783)

09c03ad

CANN: Add the basic supports of Flash Attention kernel (llama/13627)

112c144

SYCL: revert "sycl: simplify bin_bcast_kernel (ggml/13383)" (llama/13752)

8c2a700

ggml-cpu : set openmp wait time if not set (llama/13758)

276d920

ggml : add ggml_gelu_erf() CUDA kernel (llama/13719)

b154325

CUDA: fix race condition in FA vector kernels (llama/13742)

38a702a

CANN: Support MUL_MAT_ID for q8_0 and q4_0 (llama/13705)

6a9f9dc

ggml : fix the order of ggml_unary_op (llama/13718)

bdae2b3

vulkan: support CPY from any type to itself (llama/13695)

f5f766b

vulkan: Disable coopmat/coopmat2/bfloat extensions if glslc doesn't support it (llama/13696)

69679f5

use LOG_WARN to replace `std::cerr` (llama/13657)

6975ec2

sycl : Remove waits from function calls (llama/13702)

b9bf6b6

SYCL: Avoid using with SYCL-Graph for unsupported nodes (llama/13587)

7eb0e6e

opencl: Add support for multiple devices (llama/12622)

b6cddb5

opencl: fix couple crashes (llama/12795)

2eea73d

ggml : add ggml_gelu_erf() (llama/13667)

6c9cd9a

musa: Upgrade MUSA SDK version to rc4.0.1 and use mudnn::Unary::IDENTITY op to accelerate D2D memory copy (llama/13647)

9506ebb

vulkan: fix warnings (llama/13626)

8602d10

CUDA: skip fully masked-out KV in FA vec kernel (llama/13584)

e1f825c

sycl: disable reorder for sycl mulmat (llama/13536)

e023dc2

metal : fix typo in FA kernel comments (llama/13651)

4c32ada

sycl : Overcoming workaround for mmap() allocation on Windows (llama/13482)

bf74ede

Vulkan: Add f32 accumulator support to quantized mul mat to fix GLM4 32B incoherence (llama/13607)

dfa38af

sync : ggml

3b09d20

docs : convert README_sycl.md to utf8 format [no ci] (#3191)

2384106
unverified

node : enable no_prints to suppress all output (#3189)

1b2bc05
unverified

talk-llama : fix for swedish umlauts + expose model inference settings in talk-llama.cpp (#3187)

1473e33
unverified

docs : fix VAD section heading levels (#3186)

a7bcfbf
unverified

ci : use dynamic libopenblas.dll for window-blas (#3177)

bafccd1
unverified

server : Add k6 Load Testing Script (#3175)

9a681c7
unverified

docs : add VAD model download instructions [no ci] (#3180)

e789f73
unverified

docs : replace typo "]"with ")" in README (#3179)

5e8b0f0
unverified

whisper : remove redundant assignments (#3178)

ec40497
unverified

whisper : update CMakeLists.txt to handle deprecated gpu Warnings (#3163)

2ee9c36
unverified

ruby : add GGML_SYCL_DNN option to ruby bindings (#3172)

94d5ce3
unverified

talk-llama : sync llama.cpp

44ee199