Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
IlyasMoutawwakil
/
awq-exllama
like
0
Model card
Files
Files and versions
xet
Community
main
awq-exllama
415 kB
1 contributor
History:
3 commits
IlyasMoutawwakil
HF Staff
Upload folder using huggingface_hub
999b799
verified
almost 2 years ago
awq-exllama-bs-1
Upload folder using huggingface_hub
almost 2 years ago
awq-exllama-bs-16
Upload folder using huggingface_hub
almost 2 years ago
awq-exllama-bs-2
Upload folder using huggingface_hub
almost 2 years ago
awq-exllama-bs-32
Upload folder using huggingface_hub
almost 2 years ago
awq-exllama-bs-4
Upload folder using huggingface_hub
almost 2 years ago
awq-exllama-bs-64
Upload folder using huggingface_hub
almost 2 years ago
awq-exllama-bs-8
Upload folder using huggingface_hub
almost 2 years ago
awq-gemm-bs-1
Upload folder using huggingface_hub
almost 2 years ago
awq-gemm-bs-16
Upload folder using huggingface_hub
almost 2 years ago
awq-gemm-bs-2
Upload folder using huggingface_hub
almost 2 years ago
awq-gemm-bs-32
Upload folder using huggingface_hub
almost 2 years ago
awq-gemm-bs-4
Upload folder using huggingface_hub
almost 2 years ago
awq-gemm-bs-64
Upload folder using huggingface_hub
almost 2 years ago
awq-gemm-bs-8
Upload folder using huggingface_hub
almost 2 years ago
.gitattributes
1.52 kB
initial commit
almost 2 years ago
bench_awq.py
1.88 kB
Upload folder using huggingface_hub
almost 2 years ago
plot_awq.py
1.65 kB
Upload folder using huggingface_hub
almost 2 years ago
pytorch_awq_exllama.csv
119 kB
Upload folder using huggingface_hub
almost 2 years ago
pytorch_awq_exllama_per_token.latency.mean.png
19.8 kB
Upload folder using huggingface_hub
almost 2 years ago
pytorch_awq_exllama_prefill.latency.mean.png
18.2 kB
Upload folder using huggingface_hub
almost 2 years ago