Zero-shot World Models Are Developmentally Efficient Learners Paper • 2604.10333 • Published 8 days ago • 7
Dynamics as Prompts: In-Context Learning for Sim-to-Real System Identifications Paper • 2410.20357 • Published Oct 27, 2024
Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment Paper • 2602.12281 • Published Feb 12 • 1
SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation Paper • 2602.16863 • Published Feb 18 • 18
Latent Adversarial Regularization for Offline Preference Optimization Paper • 2601.22083 • Published Jan 29 • 14
Spherical Leech Quantization for Visual Tokenization and Generation Paper • 2512.14697 • Published Dec 16, 2025 • 8
Guided Query Refinement: Multimodal Hybrid Retrieval with Test-Time Optimization Paper • 2510.05038 • Published Oct 6, 2025 • 1
Personalized Preference Fine-tuning of Diffusion Models Paper • 2501.06655 • Published Jan 11, 2025 • 1
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models Paper • 2502.17387 • Published Feb 24, 2025 • 7
RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems Paper • 2510.02263 • Published Oct 2, 2025 • 9
World Modeling with Probabilistic Structure Integration Paper • 2509.09737 • Published Sep 10, 2025 • 14