R^3: Replay, Reflection, and Ranking Rewards for LLM Reinforcement Learning Paper • 2601.19620 • Published Jan 27 • 2
REVISOR: Beyond Textual Reflection, Towards Multimodal Introspective Reasoning in Long-Form Video Understanding Paper • 2511.13026 • Published Nov 17, 2025 • 26