GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning Paper • 2602.12099 • Published Feb 12 • 61
Think Dense, Not Long: Dynamic Decoupled Conditional Advantage for Efficient Reasoning Paper • 2602.02099 • Published Feb 2