ThinkTwice: Jointly Optimizing Large Language Models for Reasoning and Self-Refinement Paper • 2604.01591 • Published 13 days ago • 40
rubricreward/mR3-Qwen3-14B-tgt-prompt-tgt-thinking-translated Text Generation • 15B • Updated Oct 2, 2025 • 7
rubricreward/mR3-Qwen3-14B-tgt-prompt-tgt-thinking-translated Text Generation • 15B • Updated Oct 2, 2025 • 7