Inference Providers
Active filters: instruct
bartowski/Palmyra-Med-70B-32K-GGUF
Text Generation
• 71B • Updated • 175
• 4
mlx-community/Hermes-2-Pro-Mistral-7B-3bit
0.9B • Updated • 26
• 1
DavidAU/Llama3.2-DeepHermes-3-3B-Preview-Reasoning-MAX-NEO-Imatrix-GGUF
Text Generation
• 3B • Updated • 1.23k
• 4
DavidAU/Reka-Flash-3-21B-Reasoning-Uncensored-MAX-NEO-Imatrix-GGUF
Text Generation
• 21B • Updated • 1.13k
• 57
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos
Reinforcement Learning
• 8B • Updated • 37
• 16
Text Generation
• Updated • 7
• 1
NousResearch/Hermes-4-14B-FP8
Text Generation
• 15B • Updated • 2.82k
• 19
gabriellarson/Hermes-4-14B-GGUF
15B • Updated • 394
• 8
mradermacher/MamayLM-Gemma-3-4B-IT-v1.0-i1-GGUF
4B • Updated • 673
• 3
yaraabdalaziz/SmolLM2Instruct-FT-MealRecommendation-v3
Text Generation
• 0.1B • Updated • 3
• 1
NousResearch/Hermes-4.3-36B-GGUF
Text Generation
• 36B • Updated • 5.31k
• 44
ai-sage/GigaChat3.1-702B-A36B
Text Generation
• 715B • Updated • 4.14k
• 25
VillanovaAI/Villanova-2B-2603
Text Generation
• 2B • Updated • 2.1k
• 6
EducatingAI/Mistral-Trismegistus-7B
Updated • 17
• 1
mradermacher/Mistral-Trismegistus-7B-GGUF
7B • Updated • 446
• 1
mradermacher/Mistral-Trismegistus-7B-i1-GGUF
7B • Updated • 972
• 1
mradermacher/GRM-2.5-i1-GGUF
4B • Updated • 6.76k
• 1
HyzeAI/HyzeQwenInstruct-Q4_K_M-GGUF
Image-Text-to-Text
• 31B • Updated • 180
• 1
ValiantLabs/gemma-4-E2B-it-ShiningValiant3
Image-Text-to-Text
• 5B • Updated • 12
• 1
pthinc/Cicikus_v4_0.3B_Pitircik
Text Generation
• 0.3B • Updated • 4.96k
• 1
AEON-7/Gemma-4-31B-it-DECKARD-HERETIC-Uncensored-NVFP4
Text Generation
• 18B • Updated • 725
• 1
pbhappliedsystems/qwen-2.5-7B-instruct-gguf-Q4-K-M
8B • Updated • 936
• 1
ValiantLabs/gemma-4-31B-it-Guardpoint
Image-Text-to-Text
• 31B • Updated • 12
• 1
majentik/gemma-4-26B-A4B-it-RotorQuant
Image-Text-to-Text
• Updated • 1
majentik/gemma-4-31B-it-RotorQuant-GGUF-Q8_0
Image-Text-to-Text
• 31B • Updated • 83
• 1
dwojcik/gemma4-26b-a4b-it-codex-gguf-4bit
25B • Updated • 1.18k
• 1
majentik/Qwen3.6-35B-A3B-RotorQuant-GGUF-Q3_K_M
Image-Text-to-Text
• 35B • Updated • 1
majentik/Qwen3.6-35B-A3B-TurboQuant
Image-Text-to-Text
• Updated • 1
NousResearch/Hermes-3-Llama-3.1-8B-GGUF
8B • Updated • 10.1k
• 143
Manojb/Qwen3-4b-toolcall-gguf-llamacpp-codex
Text Generation
• 4B • Updated • 801
• 7