RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning Paper • 2603.09160 • Published Mar 10 • 15
Time To Impeach LLM-as-a-Judge: Programs are the Future of Evaluation Paper • 2506.10403 • Published Jun 12, 2025 • 1
R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training Paper • 2505.00358 • Published May 1, 2025 • 26
Evaluating Sample Utility for Data Selection by Mimicking Model Weights Paper • 2501.06708 • Published Jan 12, 2025 • 5