OpenWorldLib: A Unified Codebase and Definition of Advanced World Models Paper • 2604.04707 • Published 10 days ago • 200
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models Paper • 2603.17051 • Published 29 days ago • 109
WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation Paper • 2603.16871 • Published 29 days ago • 60
MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier Paper • 2603.03756 • Published Mar 4 • 89
Mode Seeking meets Mean Seeking for Fast Long Video Generation Paper • 2602.24289 • Published Feb 27 • 41
Solaris: Building a Multiplayer Video World Model in Minecraft Paper • 2602.22208 • Published Feb 25 • 30
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published Jan 6 • 176
StoryMem: Multi-shot Long Video Storytelling with Memory Paper • 2512.19539 • Published Dec 22, 2025 • 19
In-Video Instructions: Visual Signals as Generative Control Paper • 2511.19401 • Published Nov 24, 2025 • 32
Video-As-Prompt: Unified Semantic Control for Video Generation Paper • 2510.20888 • Published Oct 23, 2025 • 50
MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation Paper • 2510.18692 • Published Oct 21, 2025 • 41
Stable Video Infinity: Infinite-Length Video Generation with Error Recycling Paper • 2510.09212 • Published Oct 10, 2025 • 18
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation Paper • 2510.08673 • Published Oct 9, 2025 • 127