Inference Providers
Active filters: GRPO
alpha-ai/Reason-With-Choice-3B
Text Generation
• 3B • Updated • 6
mradermacher/Reason-With-Choice-3B-GGUF
3B • Updated • 52
mradermacher/Captain-Eris_Violet-GRPO-v0.420-GGUF
12B • Updated • 137
• 4
mradermacher/Captain-Eris_Violet-GRPO-v0.420-i1-GGUF
12B • Updated • 77
• 5
mradermacher/SmolLM2_135M_Grpo_Checkpoint-GGUF
0.1B • Updated • 25
Nitrals-Quants/Captain-Eris_Violet-GRPO-v0.420-4bpw-exl2
Text Generation
• Updated • 5
• 1
mradermacher/SmolLM2_135M_Grpo_Gsm8k-GGUF
0.1B • Updated • 21
mradermacher/SmolLM2_135M_Grpo_Gsm8k-i1-GGUF
0.1B • Updated • 111
mradermacher/PathFinderAI-S1-GGUF
33B • Updated • 140
mradermacher/SmolLM2_135M_Grpo_Checkpoint-i1-GGUF
0.1B • Updated • 65
mradermacher/PathFinderAI-S1-i1-GGUF
33B • Updated • 505
Nitral-Archive/Captain-Eris-BMO_Violent-GRPO-v0.420
Text Generation
• 12B • Updated • 45
• • 3
mradermacher/Captain-Eris-BMO_Violent-GRPO-v0.420-GGUF
12B • Updated • 34
• 1
Rivaidan/Captain-Eris_Violet-GRPO-v0.420-Q8_0-GGUF
12B • Updated bartowski/Nitral-AI_Captain-Eris_Violet-GRPO-v0.420-GGUF
Text Generation
• 12B • Updated • 3.44k
• 6
mradermacher/Captain-Eris-BMO_Violent-GRPO-v0.420-i1-GGUF
12B • Updated • 900
• 2
nharshavardhana/SmolGRPO-135M
Text Generation
• 0.1B • Updated • 2
TheMelonGod/Captain-Eris_Violet-GRPO-v0.420-exl2
Text Generation
• Updated • 7
Text Generation
• 0.1B • Updated • 2
Text Generation
• 0.1B • Updated • 4
Text Generation
• 0.5B • Updated • 2
kaweizhenpi/SmolGRPO-135M
Text Generation
• 0.1B • Updated • 2
Shumatsurontek/SmolGRPO-135M
Text Generation
• 0.1B • Updated • 2
stranger47/SmolLM2-1.7B-Instruct-Lora
Text Generation
• 2B • Updated • 4
• 1
TheMelonGod/Captain-Eris-BMO_Violent-GRPO-v0.420-exl2
Text Generation
• Updated • 4
Text Generation
• 0.1B • Updated • 6
• 1
Jarrodbarnes/Cortex-1-mini
Text Generation
• Updated • 6
• 2
Text Generation
• 0.1B • Updated • 5
Text Generation
• 0.1B • Updated • 1
stranger47/Qwen2.5-3B-Instruct-GRPO-NuminaMath-TIR
Text Generation
• 3B • Updated • 2