Inference Providers
Active filters: code
lmstudio-community/Phi-3.5-MoE-instruct-GGUF
Text Generation
• 42B • Updated • 140
• 5
inarikami/DeepSeek-R1-Distill-Qwen-32B-AWQ
Text Generation
• 33B • Updated • 412
• 12
ggml-org/Qwen2.5-Coder-0.5B-Q8_0-GGUF
Text Generation
• 0.5B • Updated • 109k
• 8
mradermacher/salamandra-2b-instruct-GGUF
2B • Updated • 446
• 2
unsloth/Phi-4-mini-instruct-GGUF
Text Generation
• 4B • Updated • 26.9k
• 93
JetBrains/Mellum-4b-sft-python
Text Generation
• 4B • Updated • 154
• 56
JetBrains/Mellum-4b-base-gguf
4B • Updated • 207
• 39
unsloth/Phi-4-mini-reasoning-GGUF
Text Generation
• 4B • Updated • 6.12k
• 66
mlx-community/Phi-4-mini-reasoning-6bit
Text Generation
• Updated • 24
• 1
mradermacher/Phi-4-mini-reasoning-i1-GGUF
4B • Updated • 642
• 1
ArtusDev/PocketDoc_Dans-PersonalityEngine-V1.3.0-24b_EXL3_4.0bpw_H6
Text Generation
• 6B • Updated • 27
• 3
Text Generation
• 73B • Updated • 2.27k
• • 384
zeroentropy/zerank-1-small
Text Ranking
• 2B • Updated • 15.7k
• 62
nvidia/OpenReasoning-Nemotron-1.5B
Text Generation
• 2B • Updated • 14.1k
• 55
mradermacher/zerank-1-small-GGUF
2B • Updated • 367
• 2
ByteDance-Seed/cudaLLM-8B
Text Generation
• 8B • Updated • 93
• 28
DavidAU/Openai_gpt-oss-20b-CODER-NEO-CODE-DI-MATRIX-GGUF
Text Generation
• 21B • Updated • 921
• 14
DavidAU/OpenAi-GPT-oss-20b-abliterated-uncensored-NEO-Imatrix-gguf
Text Generation
• 21B • Updated • 50.1k
• 481
mradermacher/Qwen3-MOE-4x0.6B-2.4B-Writing-Thunder-V1.2-GGUF
2B • Updated • 264
• 2
jamescallander/Qwen2.5-Coder-3B-Instruct_w8a8_g128_rk3588.rkllm
Text Generation
• Updated • 80
• 2
AQ-MedAI/Diver-Retriever-4B-1020
Text Ranking
• 4B • Updated • 2.99k
• 4
Phariadata/granite-docling-258M-untied
Text Generation
• 0.3B • Updated • 17
• 1
Text Generation
• 2B • Updated • 1.32k
• 516
Image-to-3D
• Updated • 14
• 10
Text Generation
• 15B • Updated • 140
• 9
OpenMOSS-Team/Qwen3-32B-ABC
Text Generation
• 33B • Updated • 26
• 3
janhq/Jan-v3-4B-base-instruct
Text Generation
• 4B • Updated • 247
• 61
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated • 90.3k
• 24
SamsungSAILMontreal/Qwen3-Coder-Next-REAP
Text Generation
• 60B • Updated • 347
• 3
h2loop-ai/spark-cpt-base-ckpt
Text Generation
• 7B • Updated • 1