KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance Paper • 2604.12627 • Published 4 days ago • 95
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe Paper • 2604.13016 • Published 4 days ago • 75
AgentSocialBench: Evaluating Privacy Risks in Human-Centered Agentic Social Networks Paper • 2604.01487 • Published 17 days ago • 10
SLEA-RL: Step-Level Experience Augmented Reinforcement Learning for Multi-Turn Agentic Training Paper • 2603.18079 • Published about 1 month ago • 1