Inference Providers
Active filters: redhat
RedHatAI/Qwen3-8B-quantized.w4a16
Text Generation
• 2B • Updated • 11k
• 3
RedHatAI/Qwen3-30B-A3B-quantized.w4a16
Text Generation
• 5B • Updated • 2.5k
• 7
BCCard/Qwen3-32B-FP8-Dynamic
Text Generation
• 33B • Updated • 5
• 1
BCCard/Qwen3-30B-A3B-FP8-Dynamic
Text Generation
• 31B • Updated • 25.7k
Text Generation
• 15B • Updated • 77
• 1
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8
Image-Text-to-Text
• 402B • Updated • 181
• 2
RedHatTraining/AI296-m3diterraneo-hotels
8B • Updated • 42
• 1
RedHatAI/DeepSeek-R1-0528-quantized.w4a16
Text Generation
• 104B • Updated • 755
• 13
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-quantized.w4a16
Image-Text-to-Text
• 59B • Updated • 369
• 1
Image-Text-to-Text
• 109B • Updated • 3
RedHatAI/Kimi-K2-Instruct-quantized.w4a16
Text Generation
• 1T • Updated • 571
• 12
nm-testing/Llama-3.1-8B-Instruct-speculator.eagle3-converted
Text Generation
• 1.0B • Updated • 227
RedHatAI/SmolLM3-3B-quantized.w4a16
0.9B • Updated • 25
• 1
Text-to-Image
• Updated • 6
RedHatAI/Devstral-Small-2507-FP8-Dynamic
Text Generation
• 24B • Updated • 30
• 4
RedHatAI/Devstral-Small-2507-quantized.w8a8
Text Generation
• 24B • Updated • 91
• 1
RedHatAI/Devstral-Small-2507-quantized.w4a16
Text Generation
• 4B • Updated • 24
• 2
RedHatAI/Qwen3-14B-speculator.eagle3
Text Generation
• 1B • Updated • 6.04k
RedHatAI/Qwen3-32B-speculator.eagle3
Text Generation
• 2B • Updated • 883
• 8
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3
Text Generation
• 2B • Updated • 3.02k
• 1
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3
Text Generation
• 1.0B • Updated • 24.3k
• 2
RedHatAI/Qwen3-8B-speculator.eagle3
Text Generation
• 1B • Updated • 71.7k
• 28
RedHatAI/Qwen3-235B-A22B-Instruct-2507-speculator.eagle3
Text Generation
• 1B • Updated • 810
ChibuUkachi/Qwen3-4B-Instruct-2507.w4a16
Text Generation
• 1B • Updated • 5
RedHatAI/Qwen3-4B-Thinking-2507-quantized.w4a16
Text Generation
• 4B • Updated • 247
RedHatAI/Qwen3-4B-Instruct-2507-quantized.w4a16
Text Generation
• 4B • Updated • 158
RedHatAI/Qwen3-30B-A3B-Thinking-2507-quantized.w4a16
Text Generation
• 5B • Updated • 81
RedHatAI/Qwen3-30B-A3B-Instruct-2507-quantized.w4a16
Text Generation
• 5B • Updated • 1.47k
• 1
RedHatAI/Qwen3-Next-80B-A3B-Instruct-quantized.w4a16
Text Generation
• 12B • Updated • 319
• 3
RedHatAI/Qwen3-30B-A3B-Instruct-2507-speculator.eagle3
Text Generation
• 0.5B • Updated • 794
• 2