Inference Providers
Active filters: sglang
Jackrong/Qwopus3.5-27B-v3-FP8-vllm-ready
Text Generation
• 27B • Updated • 13.8k
• 17
AxionML/Qwen3.5-27B-NVFP4
Image-Text-to-Text
• 17B • Updated • 12k
• 10
thoughtworks/MiniMax-M2.5-Eagle3
Text Generation
• 0.2B • Updated • 1.72k
• 3
Image-Text-to-Text
• 125B • Updated • 42
• 2
NinjaBoffin/MiniMax-M2.7-NVFP4
Text Generation
• 116B • Updated • 664
• 2
alvarobartt/grok-2-tokenizer
Text Generation
• Updated • 56
• 5
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated • 75.7k
• 25
AxionML/Qwen3.5-35B-A3B-NVFP4
Image-Text-to-Text
• Updated • 9.44k
• 5
Jetlink/JetLLMPremium-3.5
Image-Text-to-Text
• 403B • Updated • 28
• 1
Image-Text-to-Text
• 33B • Updated • 48
• 1
thoughtworks/GLM-4.7-FP8-Eagle3
Text Generation
• 0.6B • Updated • 27
• 1
thoughtworks/Qwen3-Coder-Next-Eagle3
Text Generation
• 0.1B • Updated • 480
• 1
dervig/m51Lab-MiniMax-M2.7-REAP-139B-A10B-NVFP4-GB10
Text Generation
• 79B • Updated • 894
• 1
scottgl/Qwen3.5-122B-A10B-NVFP4-GB10
Text Generation
• 27B • Updated • 2.74k
• 2
scottgl/MiniMax-M2.7-REAP-172B-A10B-NVFP4-GB10
Text Generation
• Updated • 562
• 2
SurfaceData/llava-v1.6-mistral-7b-sglang
Image-Text-to-Text
• 8B • Updated • 32
• 9
SurfaceData/llava-v1.6-vicuna-7b-sglang
Image-Text-to-Text
• 7B • Updated • 40
• 1
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 90
• 2
tclf90/qwen2.5-72b-instruct-gptq-int3
Text Generation
• 69B • Updated • 66
173B • Updated • 2.66k
• 35
mradermacher/MiniMax-M2-THRIFT-GGUF
JasmineBBB/Kimi-Linear-48B-A3B-Instruct-bnb-4bit
Text Generation
• 49B • Updated • 16
• 1
mradermacher/MiniMax-M2-THRIFT-i1-GGUF
173B • Updated • 169
• 10
bartowski/VibeStudio_MiniMax-M2-THRIFT-GGUF
Text Generation
• 173B • Updated • 458
• 8
osmapi/MiniMax-M2-THRIFT-55
106B • Updated • 247
• 5
JinnP/SGLang-EAGLE3-Qwen3-Coder-30B-A3B-Instruct
Text Generation
• 0.2B • Updated • 73
• 1
mradermacher/MiniMax-M2-THRIFT-55-GGUF
106B • Updated • 145
• 2
mradermacher/MiniMax-M2-THRIFT-55-i1-GGUF
106B • Updated • 1.78k
• 2
osmapi/MiniMax-M2-THRIFT-55-MLX-4bit
106B • Updated • 78
• 2