Text-to-videos
updated
Direct-a-Video: Customized Video Generation with User-Directed Camera
Movement and Object Motion
Paper
• 2402.03162
• Published • 20
InteractiveVideo: User-Centric Controllable Video Generation with
Synergistic Multimodal Instructions
Paper
• 2402.03040
• Published • 19
Magic-Me: Identity-Specific Video Customized Diffusion
Paper
• 2402.09368
• Published • 31
LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video
Editing
Paper
• 2402.10294
• Published • 27
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video
Synthesis
Paper
• 2402.14797
• Published • 21
Sora: A Review on Background, Technology, Limitations, and Opportunities
of Large Vision Models
Paper
• 2402.17177
• Published • 87
FIFO-Diffusion: Generating Infinite Videos from Text without Training
Paper
• 2405.11473
• Published • 56
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language
Models via Instruction Tuning
Paper
• 2405.18386
• Published • 22