Inference Providers
Active filters: 8-bit
mlx-community/Qwen3-Coder-Next-8bit
Text Generation
• 80B • Updated • 5.2k
• 13
osoleve/Qwen3.5-27B-Text-NVFP4-MTP
Text Generation
• 17B • Updated • 12.3k
• 17
cosmicproc/Qwen3.5-4B-NVFP4
Image-Text-to-Text
• 3B • Updated • 1.14k
• 2
0.3B • Updated • 184
• 4
caiovicentino1/Gemma-4-26B-A4B-it-HLWQ-Q5
Image-Text-to-Text
• 27B • Updated • 360
• 7
RedHatAI/gemma-4-26B-A4B-it-NVFP4
15B • Updated • 45.7k
• 15
DJLougen/Ornstein-26B-A4B-it-FP8
Image-Text-to-Text
• 26B • Updated • 83
• 4
OptimizeLLM/Qwen3.5-122B-A10B-heretic-MTP-NVFP4
Text Generation
• 74B • Updated • 962
• 3
djdeniro/Qwen3.5-397B-A17B-MXFP4
Image-Text-to-Text
• 215B • Updated • 53
• 2
olka-fi/MiniMax-M2.7-MXFP4
Text Generation
• 123B • Updated • 613
• 2
majentik/gemma-4-31B-it-TurboQuant-MLX-8bit
Image-Text-to-Text
• 9B • Updated • 468
• 2
zecanard/Qwen3.5-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-MLX-8bit-int8-affine
Image-Text-to-Text
• 13B • Updated • 235
• 2
NinjaBoffin/MiniMax-M2.7-NVFP4
Text Generation
• 116B • Updated • 272
• 2
kernelpool/gemma-4-E4B-it-OBLITERATED-8bit
Text Generation
• 8B • Updated • 1.01k
• 2
sakamakismile/SuperGemma4-26B-Abliterated-Multimodal-NVFP4
Image-Text-to-Text
• 15B • Updated • 154
• 2
arthurcollet/Qwen3.6-35B-A3B-mlx-mxfp8
Image-Text-to-Text
• 10B • Updated • 4.38k
• 2
bearzi/Qwen3.6-35B-A3B-oQ8
Text Generation
• 10B • Updated • 913
• 2
Youssofal/Qwen3.6-35B-A3B-Abliterated-Heretic-MLX-8bit
Text Generation
• Updated • 1.48k
• 3
deepsweet/Qwen3.6-35B-A3B-MLX-oQ8
Text Generation
• 10B • Updated • 1.09k
• 2
saricles/MiniMax-M2.7-NVFP4-GB10-AC
Text Generation
• 119B • Updated • 2
MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF
Text Generation
• 7B • Updated • 117k
• 136
MaziyarPanahi/Phi-3.5-mini-instruct-GGUF
Text Generation
• 4B • Updated • 412k
• 28
MaziyarPanahi/Yi-Coder-1.5B-Chat-GGUF
Text Generation
• 1B • Updated • 90.8k
• 19
HF1BitLLM/Llama3-8B-1.58-100B-tokens
Text Generation
• 3B • Updated • 2.29k
• 211
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation
• 3B • Updated • 89.9k
• 15
mlx-community/Qwen2.5.1-Coder-7B-Instruct-8bit
Text Generation
• Updated • 156
• 3
lmstudio-community/Qwen2.5-Coder-14B-Instruct-MLX-8bit
Text Generation
• 4B • Updated • 107k
• 2
mlx-community/DeepSeek-R1-Distill-Llama-8B-8bit
2B • Updated • 137
• 4
mlx-community/Qwen2.5-14B-Instruct-1M-8bit
Text Generation
• 4B • Updated • 232
• 11
MaziyarPanahi/Phi-4-mini-instruct-GGUF
Text Generation
• 4B • Updated • 96.2k
• 12