TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs' Social Intelligence Paper • 2505.24500 • Published May 30, 2025 • 12
Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining Paper • 2504.07912 • Published Apr 10, 2025 • 1