Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis Paper • 2603.29620 • Published 18 days ago • 46
facebook/dinov3-vit7b16-pretrain-lvd1689m Image Feature Extraction • 7B • Updated Aug 19, 2025 • 11.6k • 223
Towards Physically Plausible Video Generation via VLM Planning Paper • 2503.23368 • Published Mar 30, 2025 • 40
Towards Physically Plausible Video Generation via VLM Planning Paper • 2503.23368 • Published Mar 30, 2025 • 40