Inference Providers
Active filters: redhat
Text Generation
• 15B • Updated • 81
• 1
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8
Image-Text-to-Text
• 402B • Updated • 106
• 2
RedHatTraining/AI296-m3diterraneo-hotels
8B • Updated • 51
• 1
RedHatAI/DeepSeek-R1-0528-quantized.w4a16
Text Generation
• 676B • Updated • 623
• 13
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16
Image-Text-to-Text
• 59B • Updated • 3.58k
• 1
Image-Text-to-Text
• 109B • Updated • 2
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
Text Generation
• 1T • Updated • 840
• 12
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted
Text Generation
• 1.0B • Updated • 924
RedHatAI/SmolLM3-3B-quantized.w4a16
0.9B • Updated • 43
• 1
Text-to-Image
• Updated • 1
RedHatAI/Devstral-Small-2507-FP8-Dynamic
Text Generation
• 24B • Updated • 38
• 4
RedHatAI/Devstral-Small-2507-quantized.w8a8
Text Generation
• 24B • Updated • 45
• 1
RedHatAI/Devstral-Small-2507-quantized.w4a16
Text Generation
• 4B • Updated • 16
• 2
RedHatAI/Qwen3-14B-speculator.eagle3
Text Generation
• 1B • Updated • 7.52k
RedHatAI/Qwen3-32B-speculator.eagle3
Text Generation
• 2B • Updated • 800
• 8
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3
Text Generation
• 2B • Updated • 4.52k
• 1
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3
Text Generation
• 1.0B • Updated • 23.4k
• 2
RedHatAI/Qwen3-8B-speculator.eagle3
Text Generation
• 1B • Updated • 67.2k
• 28
RedHatAI/gpt-oss-20b-speculator.eagle3
Text Generation
• 0.9B • Updated • 72.5k
• 8
RedHatAI/Qwen3-235B-A22B-Instruct-2507-speculator.eagle3
Text Generation
• 1B • Updated • 147
ChibuUkachi/Qwen3-4B-Instruct-2507.w4a16
Text Generation
• 1B • Updated • 5
RedHatAI/Qwen3-4B-Thinking-2507-quantized.w4a16
Text Generation
• 4B • Updated • 66
RedHatAI/Qwen3-4B-Instruct-2507-quantized.w4a16
Text Generation
• 4B • Updated • 314
RedHatAI/Qwen3-30B-A3B-Thinking-2507-quantized.w4a16
Text Generation
• 5B • Updated • 252
RedHatAI/Qwen3-30B-A3B-Instruct-2507-quantized.w4a16
Text Generation
• 5B • Updated • 1.63k
• 1
RedHatAI/Qwen3-Next-80B-A3B-Instruct-quantized.w4a16
Text Generation
• 12B • Updated • 846
• 3
RedHatAI/Qwen3-30B-A3B-Instruct-2507-speculator.eagle3
Text Generation
• 0.5B • Updated • 949
• 2
RedHatAI/Qwen3-Next-80B-A3B-Thinking-quantized.w4a16
Text Generation
• Updated • 10
RedHatAI/Qwen3-235B-A22B-speculator.eagle3
1B • Updated • 18
• 1
RedHatAI/Qwen3-4B-Instruct-2507-quantized.w8a8
Text Generation
• 4B • Updated • 993