alloc : fix allocation data of pre-allocated leafs 0c378f2 unverified slaren commited on Mar 16, 2024
llama : add pipeline parallelism support (llama/6017) b5bb3f3 unverified slaren compilade ggerganov commited on Mar 13, 2024
ggml : add ALiBi support for ggml_soft_max_ext (llama/5488) 26c019a unverified ggerganov commited on Feb 19, 2024
ggml-alloc : allocate all leafs as if they were inputs (ggml/731) a512417 unverified slaren commited on Feb 12, 2024
ggml alloc: Fix for null dereference on alloc failure (llama/5200) 8181686 unverified Paul Tsochantaris commited on Jan 29, 2024
ggml : add Vulkan backend (llama/2059) 5a97aba unverified OccamRazor SlyEcho Concedo slaren ggerganov commited on Jan 28, 2024
ggml-alloc : add 10% margin to the buffer sizes (llama/5149) c55bdf8 unverified slaren commited on Jan 26, 2024
llama : pre-allocate input tensors in a separate buffer (llama/5100) 20a4ca1 unverified slaren commited on Jan 24, 2024
llama : ggml-backend integration (llama/4766) 362430b unverified slaren ggerganov JohannesGaessler commited on Jan 12, 2024
sync : ggml (ggml_scale, ggml_row_size, etc.) (#1677) aa86ade unverified ggerganov commited on Dec 22, 2023
sync : ggml (ggml-alloc + linker + gguf fixes) (#1501) 58507b9 unverified ggerganov commited on Nov 17, 2023
sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422) 7006035 unverified ggerganov Chris Raethke commited on Nov 3, 2023
ggml : sync (ggml-alloc, GPU, eps, etc.) (#1220) d41ba35 unverified ggerganov commited on Sep 5, 2023