view article Article CircleGuardBench: New Standard for Evaluating AI Moderation Models May 7, 2025 • 60
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published 27 days ago • 153
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 Mar 10 • 124
togethercomputer/CoderForge-Preview-32B-SWE-Bench-Verified-Evaluation-trajectories Viewer • Updated Feb 2 • 500 • 79 • 13
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated 5 days ago • 17
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 • 53
Robust Speech Recognition via Large-Scale Weak Supervision Paper • 2212.04356 • Published Dec 6, 2022 • 53