whisper.cpp / ggml-cuda.cu
JohannesGaessler's picture
CUDA: faster q8_0 -> f16 dequantization (llama/4895)
0a1a178 unverified
raw
history
423 kB
File too large to display, you can check the raw version instead.