Search-R1 Collection Preliminary checkpoints with outcome-only RL. • 15 items • Updated Aug 12, 2025 • 17
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B Text Generation • 15B • Updated Feb 24, 2025 • 598k • • 624
unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF Text Generation • 31B • Updated Jan 30 • 141k • 583
Qwen/Qwen3-Coder-30B-A3B-Instruct Text Generation • 31B • Updated Dec 3, 2025 • 1.31M • • 1k
Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation • 480B • Updated Aug 21, 2025 • 83.1k • • 1.32k
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF Text Generation • 480B • Updated Jul 31, 2025 • 6.56k • 175