Support multiple GPUs (split mode) on SYCL backend (llama/5806) b1865d2 unverified Neo Zhang Jianyu commited on Mar 2, 2024
Use batched mul_mat pathway (llama/5591) 4a30367 unverified AidanBeltonS Abhilash Majumder commited on Mar 1, 2024
Add support for soft_max ALiBi (llama/5639) 86d6a5e unverified AidanBeltonS Abhilash Majumder commited on Feb 26, 2024
code : normalize enum names (llama/5697) 93e0830 unverified ggerganov HF Staff commited on Feb 25, 2024
Update ggml_sycl_op_mul_mat_vec_q (llama/5502) 963ffd5 unverified AidanBeltonS Abhilash Majumder commited on Feb 20, 2024
ggml-sycl: Replace 3d ops with macro (llama/5458) 12970f1 unverified Abhilash Majumder commited on Feb 12, 2024