V-GRPO: Online Reinforcement Learning for Denoising Generative Models Is Easier than You Think Paper • 2604.23380 • Published 13 days ago • 4