-
-
-
-
-
-
Inference Providers
Active filters:
vLLM
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Lite
Text Generation
•
721B
•
Updated
•
11
•
1
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Compact
Text Generation
•
847B
•
Updated
•
8
•
5
QuantTrio/DeepSeek-R1-0528-GPTQ-Int4-Int8Mix-Medium
Text Generation
•
912B
•
Updated
•
43
•
1
brandonbeiler/InternVL3-38B-FP8-Dynamic
Image-Text-to-Text
•
38B
•
Updated
•
62
brandonbeiler/InternVL3-78B-FP8-Dynamic
Image-Text-to-Text
•
78B
•
Updated
•
71
brandonbeiler/InternVL3-8B-FP8-Dynamic
Image-Text-to-Text
•
8B
•
Updated
•
11
•
2
dengcao/GLM-4.1V-9B-Thinking-GPTQ-Int4-Int8Mix
Image-Text-to-Text
•
15B
•
Updated
•
31
•
2
dengcao/GLM-4.1V-9B-Thinking-AWQ
Image-Text-to-Text
•
10B
•
Updated
•
279k
•
1
brandonbeiler/Skywork-R1V3-38B-FP8-Dynamic
Image-Text-to-Text
•
38B
•
Updated
•
10
•
1
koushd/Qwen3-235B-A22B-Instruct-2507-AWQ
Text Generation
•
235B
•
Updated
•
65
•
4
QuantTrio/Qwen3-235B-A22B-Instruct-2507-GPTQ-Int4-Int8Mix
Text Generation
•
248B
•
Updated
•
317
•
2
QuantTrio/Qwen3-235B-A22B-Instruct-2507-AWQ
Text Generation
•
235B
•
Updated
•
2.76k
•
10
QuantTrio/GLM-4.1V-9B-Thinking-GPTQ-Int4-Int8Mix
Text Generation
•
15B
•
Updated
•
5
•
1
QuantTrio/GLM-4.1V-9B-Thinking-AWQ
Text Generation
•
10B
•
Updated
•
245
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-AWQ
Text Generation
•
480B
•
Updated
•
515
•
8
QuantTrio/Qwen3-Coder-480B-A35B-Instruct-GPTQ-Int4-Int8Mix
Text Generation
•
534B
•
Updated
•
184
•
6
QuantTrio/Qwen3-235B-A22B-Thinking-2507-AWQ
Text Generation
•
235B
•
Updated
•
1.71k
•
5
QuantTrio/Qwen3-235B-A22B-Thinking-2507-GPTQ-Int4-Int8Mix
Text Generation
•
253B
•
Updated
•
128
•
2
QuantTrio/GLM-4.5-Air-AWQ-FP16Mix
Text Generation
•
24B
•
Updated
•
3.16k
•
14
QuantTrio/GLM-4.5-Air-GPTQ-Int4-Int8Mix
Text Generation
•
20B
•
Updated
•
2.76k
•
10
QuantTrio/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
•
31B
•
Updated
•
899
•
9
QuantTrio/GLM-4.5-GPTQ-Int4-Int8Mix
Text Generation
•
55B
•
Updated
•
5
•
5
Text Generation
•
53B
•
Updated
•
24
•
9
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ-BF16Mix
Text Generation
•
14B
•
Updated
•
132
•
4
QuantTrio/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
•
31B
•
Updated
•
721
•
2
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ
Text Generation
•
31B
•
Updated
•
84.9k
•
2
QuantTrio/KAT-V1-40B-GPTQ-Int4-Int8Mix
Text Generation
•
47B
•
Updated
•
16
QuantTrio/Qwen3-Coder-30B-A3B-Instruct-GPTQ-Int8
Text Generation
•
31B
•
Updated
•
2.47k
•
6
QuantTrio/Qwen3-Coder-30B-A3B-Instruct-AWQ
Text Generation
•
31B
•
Updated
•
175k
•
5
EliovpAI/Qwen3-14B-FP8-KV
Text Generation
•
15B
•
Updated
•
13
•
2