-
-
-
-
-
-
Inference Providers
Active filters:
gptq
ChenMnZ/Llama-2-13b-EfficientQAT-w2g128-BitBLAS
Text Generation
•
51B
•
Updated
•
6
ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-BitBLAS
Text Generation
•
51B
•
Updated
•
9
ChenMnZ/Llama-2-13b-EfficientQAT-w2g64-GPTQ
Text Generation
•
13B
•
Updated
•
4
ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-BitBLAS
Text Generation
•
51B
•
Updated
•
5
Xu-Ouyang/pythia-2.8b-deduped-int4-step129000-GPTQ-wikitext2
Text Generation
•
3B
•
Updated
•
3
ChenMnZ/Llama-2-13b-EfficientQAT-w4g128-GPTQ
Text Generation
•
13B
•
Updated
•
12
ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-BitBLAS
Text Generation
•
274B
•
Updated
•
2
ChenMnZ/Llama-2-70b-EfficientQAT-w2g128-GPTQ
Text Generation
•
69B
•
Updated
•
13
ChenMnZ/Llama-2-70b-EfficientQAT-w2g64-GPTQ
Text Generation
•
69B
•
Updated
•
8
ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-BitBLAS
Text Generation
•
275B
•
Updated
•
5
ChenMnZ/Llama-2-70b-EfficientQAT-w4g128-GPTQ
Text Generation
•
69B
•
Updated
•
21
Xu-Ouyang/pythia-2.8b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
•
3B
•
Updated
•
4
Xu-Ouyang/pythia-12b-deduped-int3-step14000-GPTQ-wikitext2
Text Generation
•
11B
•
Updated
•
5
ChenMnZ/Llama-2-7b-EfficientQAT-w2g128-GPTQ
Text Generation
•
7B
•
Updated
•
14
ChenMnZ/Llama-2-7b-EfficientQAT-w2g64-GPTQ
Text Generation
•
7B
•
Updated
•
8
•
1
Xu-Ouyang/pythia-2.8b-deduped-int3-step29000-GPTQ-wikitext2
Text Generation
•
3B
•
Updated
•
4
ModelCloud/gemma-2-27b-it-gptq-4bit
Text Generation
•
28B
•
Updated
•
33
•
12
ChenMnZ/Llama-2-7b-EfficientQAT-w4g128-GPTQ
Text Generation
•
7B
•
Updated
•
14
ChenMnZ/Llama-3-70b-EfficientQAT-w2g128-GPTQ
Text Generation
•
71B
•
Updated
•
6
ChenMnZ/Llama-3-70b-EfficientQAT-w2g64-GPTQ
Text Generation
•
71B
•
Updated
•
7
ChenMnZ/Llama-3-70b-EfficientQAT-w4g128-GPTQ
Text Generation
•
71B
•
Updated
•
7
Xu-Ouyang/pythia-2.8b-deduped-int3-step43000-GPTQ-wikitext2
Text Generation
•
3B
•
Updated
•
4
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g128-GPTQ
Text Generation
•
71B
•
Updated
•
6
Llamarider222/Mixtral_8x7B_GPTQ
Text Generation
•
47B
•
Updated
•
15
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w2g64-GPTQ
Text Generation
•
71B
•
Updated
•
7
ChenMnZ/Llama-2-7b-EfficientQAT-w2g128-BitBLAS
Text Generation
•
26B
•
Updated
•
5
ChenMnZ/Llama-3-70b-instruct-EfficientQAT-w4g128-GPTQ
Text Generation
•
71B
•
Updated
•
4
ChenMnZ/Llama-2-7b-EfficientQAT-w2g64-BitBLAS
Text Generation
•
26B
•
Updated
•
6
Xu-Ouyang/pythia-2.8b-deduped-int3-step57000-GPTQ-wikitext2
Text Generation
•
3B
•
Updated
•
4
ChenMnZ/Llama-2-7b-EfficientQAT-w4g128-BitBLAS
Text Generation
•
26B
•
Updated
•
4