whisper.cpp / ggml-cuda.h

Commit History

sync : ggml (#2001)
cbbfa9e
unverified

ggerganov commited on

ggml : introduce GGML_CALL function annotation (llama/4850)
7815f68
unverified

jartine commited on

sync : ggml (new ops, new backend, etc) (#1602)
895e87a
unverified

ggerganov commited on

whisper : add batched decoding (#1486)
0131aa6
unverified

ggerganov commited on

sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422)
7006035
unverified

ggerganov Chris Raethke commited on

ggml : sync (ggml-alloc, GPU, eps, etc.) (#1220)
d41ba35
unverified

ggerganov commited on

ggml : sync latest repo (mostly refactoring changes)
d97fd69
unverified

ggerganov commited on

ggml : sync latest ggml lib
a100c9a
unverified

ggerganov commited on

ggml : sync latest ggml repo
6ee8740
unverified

ggerganov commited on

ggml : sync latest ggml
803e1be
unverified

ggerganov commited on

ggml : sync ggml (clBLAST + tensor names)
f50d3b3
unverified

ggerganov commited on

ggml : sync latest ggml + llama.cpp updates (quantization)
ede1268
unverified

ggerganov commited on