Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2 Image-Text-to-Text • 28B • Updated 11 days ago • 350k • 112
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper • 2505.03335 • Published May 6, 2025 • 191