Rethinking Diverse Human Preference Learning through Principal Component Analysis Paper • 2502.13131 • Published Feb 18, 2025 • 37
MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning Paper • 2505.24846 • Published May 30, 2025 • 15
Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay Paper • 2506.05316 • Published Jun 5, 2025 • 1
rlvr-weak-supervision Collection Models from "When Can LLMs Learn to Reason with Weak Supervision?" — Llama-3.2-3B with continual pre-training and Thinking SFT. • 3 items • Updated about 18 hours ago • 1
SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning Paper • 2512.03244 • Published Dec 2, 2025 • 17
X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents Paper • 2504.13203 • Published Apr 15, 2025 • 35
MOSAIC: Modeling Social AI for Content Dissemination and Regulation in Multi-Agent Simulations Paper • 2504.07830 • Published Apr 10, 2025 • 18
Generalization in Healthcare AI: Evaluation of a Clinical Large Language Model Paper • 2402.10965 • Published Feb 14, 2024 • 1
Understanding Disparities in Post Hoc Machine Learning Explanation Paper • 2401.14539 • Published Jan 25, 2024