Prompt Relay: Inference-Time Temporal Control for Multi-Event Video Generation
Paper • 2604.10030 • Published • 14
Computer Vision and Deep Learning
MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction
Bridging Semantic and Kinematic Conditions with Diffusion-based Discrete Motion Tokenizer