Prompt Relay: Inference-Time Temporal Control for Multi-Event Video Generation Paper • 2604.10030 • Published 5 days ago • 12
VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenization Paper • 2604.12887 • Published 1 day ago • 1
ELT: Elastic Looped Transformers for Visual Generation Paper • 2604.09168 • Published 6 days ago • 18
Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout Paper • 2511.20649 • Published Nov 25, 2025 • 51
Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression Paper • 2512.05081 • Published Dec 4, 2025 • 33
Representation Alignment for Just Image Transformers is not Easier than You Think Paper • 2603.14366 • Published about 1 month ago • 13
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling Paper • 2603.25746 • Published 20 days ago • 155
VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward Paper • 2603.26599 • Published 19 days ago • 62
Inference-time Physics Alignment of Video Generative Models with Latent World Models Paper • 2601.10553 • Published Jan 15 • 13
PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference Paper • 2603.25730 • Published 20 days ago • 52
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published 30 days ago • 153
WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation Paper • 2603.16871 • Published 29 days ago • 60
Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published Mar 3 • 145
Latent Particle World Models: Self-supervised Object-centric Stochastic Dynamics Modeling Paper • 2603.04553 • Published Mar 4 • 3