RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic Image-Text-to-Text • 109B • Updated Sep 22, 2025 • 32.4k • 28
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16 Image-Text-to-Text • 20B • Updated Sep 22, 2025 • 25.7k • 12
RedHatAI/Llama-4-Maverick-17B-128E-Instruct Image-Text-to-Text • 402B • Updated Sep 22, 2025 • 30 • 3
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8 Image-Text-to-Text • 402B • Updated Sep 22, 2025 • 181 • 2
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic Image-Text-to-Text • 24B • Updated Oct 29, 2025 • 1.32k • 9
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8 Image-Text-to-Text • 24B • Updated Oct 29, 2025 • 409 • 5
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16 Image-Text-to-Text • 5B • Updated Oct 29, 2025 • 3.06k • 10
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text • 24B • Updated Sep 22, 2025 • 59 • 1
RedHatAI/Mistral-Small-24B-Instruct-2501-FP8-dynamic Text Generation • 24B • Updated Oct 29, 2025 • 897 • 13
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w8a8 Text Generation • 24B • Updated Oct 29, 2025 • 18.8k • 1
RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic Text Generation • 71B • Updated Dec 12, 2025 • 35.2k • 15
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • 71B • Updated Oct 23, 2025 • 204 • 15
RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8 Text Generation • 71B • Updated Sep 22, 2025 • 3.44k • 13
RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16 Text Generation • 11B • Updated Sep 22, 2025 • 2.47k • 3
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16 Text Generation • 1B • Updated Sep 22, 2025 • 1.29k • 1
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16 Text Generation • 4B • Updated Oct 29, 2025 • 4.47k • 1
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8 Text Generation • 8B • Updated Sep 22, 2025 • 7.76k • 20