Commit History

talk-llama : sync llama.cpp
06c222c
unverified

ggerganov commited on

talk-llama : sync llama.cpp
b92d757
unverified

ggerganov commited on

talk-llama : sync llama.cpp
53d0282
unverified

ggerganov commited on

talk-llama : sync llama.cpp
542accf
unverified

ggerganov commited on

talk-llama : sync llama.cpp
aa42df9
unverified

ggerganov commited on

talk-llama : sync llama.cpp
e6d6e1d
unverified

ggerganov commited on

talk-llama : sync llama.cpp
1453539
unverified

ggerganov commited on

talk-llama : sync llama.cpp
92cfd93
unverified

ggerganov commited on

sync : llama.cpp
5de718a
unverified

ggerganov commited on

talk-llama : llama.cpp
d128cb3
unverified

ggerganov commited on

talk-llama : sync llama.cpp
b9d2bd9
unverified

ggerganov commited on

talk-llama : sync llama.cpp
75c5f9c
unverified

ggerganov commited on

talk-llama : sync llama.cpp
f33490f
unverified

ggerganov commited on

talk-llama : sync latest llama.cpp
42123fc
unverified

ggerganov commited on

sync : ggml (ggml_scale, ggml_row_size, etc.) (#1677)
aa86ade
unverified

ggerganov commited on

sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422)
7006035
unverified

ggerganov Chris Raethke commited on

talk-llama : update to latest llama.cpp
1493d0c
unverified

ggerganov commited on

talk-llama : sync latest llama.cpp (close #922, close #954)
ad4065a
unverified

ggerganov commited on

talk-llama : fix build + sync latest llama.cpp
ef85c02
unverified

ggerganov commited on

talk-llama : only copy used KV cache in get / set state (#890)
773c85f
unverified

Luis Herrera evanqjones commited on

talk-llama : add --session support (#845)
a7b3aa5
unverified

Luis Herrera commited on

whisper : add integer quantization support (#540)
a5f8f3c
unverified

ggerganov commited on

talk-llama : update to latest llama.cpp (improved performance)
ab6eb47
unverified

ggerganov commited on

talk-llama : add new example + sync ggml from llama.cpp (#664)
a8c74e6
unverified

ggerganov commited on