CaRR & C-GRPO Collection Data and models for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards". • 6 items • Updated 27 days ago • 1
CohereLabs/cohere-transcribe-03-2026 Automatic Speech Recognition • Updated about 2 hours ago • 277k • 896