Inference Providers
Active filters: llama-3
meta-llama/Llama-3.2-90B-Vision
Image-Text-to-Text
• 89B • Updated • 2.61k
• 134
hugging-quants/Llama-3.2-1B-Instruct-Q4_K_M-GGUF
Text Generation
• 1B • Updated • 34.9k
• 20
mlx-community/Llama-3.2-3B-Instruct-4bit
Text Generation
• 0.5B • Updated • 43.5k
• 43
lmstudio-community/Llama-3.2-3B-Instruct-GGUF
Text Generation
• 3B • Updated • 27.3k
• 43
unsloth/Llama-3.2-3B-Instruct
Text Generation
• 3B • Updated • 221k
• 88
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-Text-to-Text
• 11B • Updated • 8.32k
• 81
Text Generation
• 7B • Updated • 638
• 27
tjake/Llama-3.2-1B-Instruct-JQ4
Text Generation
• Updated • 1.51k
• 4
DavidAU/Maximizing-Model-Performance-All-Quants-Types-And-Full-Precision-by-Samplers_Parameters
Updated • 182
bartowski/Llama-3.3-70B-Instruct-GGUF
Text Generation
• 71B • Updated • 12.8k
• 73
mradermacher/Llama-3.3-70B-Instruct-abliterated-GGUF
71B • Updated • 685
• 5
DavidAU/L3-MOE-4x8B-Dark-Planet-Rebel-FURY-25B-GGUF
Text Generation
• 25B • Updated • 271
• 6
unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF
Text Generation
• 8B • Updated • 42k
• 298
nvidia/Llama-3_3-Nemotron-Super-49B-v1
Text Generation
• 50B • Updated • 32.7k
• 322
nvidia/Llama-3.1-Nemotron-Nano-8B-v1
Text Generation
• 8B • Updated • 55k
• • 222
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
Text Generation
• Updated • 6.98k
• • 345
mlx-community/Llama-3_3-Nemotron-Super-49B-v1-mlx-4bit
Text Generation
• 8B • Updated • 429
• 2
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1-FP8
Text Generation
• 253B • Updated • 4.49k
• 12
nvidia/Llama-3.1-Nemotron-Nano-4B-v1.1
Text Generation
• 5B • Updated • 25.8k
• 114
nvidia/Llama-3_3-Nemotron-Super-49B-v1-FP8
Text Generation
• 50B • Updated • 4.53k
• 13
Repoaner/llama_guard_vision
Image-Text-to-Text
• 11B • Updated • 5
• 1
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8
Text Generation
• 50B • Updated • 56.2k
• 27
mradermacher/Llama-3.1-8B-Instruct-heretic-i1-GGUF
8B • Updated • 5.3k
• 3
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4
Text Generation
• 26B • Updated • 13.7k
• 17
JohnsonPedia/llama-3-8b-yoruba-chat-gguf
Text Generation
• 8B • Updated • 43
• 1
srswti/Llama-3.2-11B-Vision-Instruct-abliterated-4-bit
Image-to-Text
• 2B • Updated • 356
• 1
Mungert/Llama3.3-8B-Instruct-Thinking-Claude-4.5-Opus-High-Reasoning-GGUF
Text Generation
• 8B • Updated • 1.48k
• 2
ashishnair/Llama-Ione-8B-roleplay-v1
Text Generation
• 8B • Updated • 658
• 1
meta-llama/Llama-Guard-3-8B
Text Generation
• 8B • Updated • 132k
• • 291
Text Generation
• 3B • Updated • 1M
• 754