I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models Paper • 2312.16693 • Published Dec 27, 2023 • 14
VideoTetris: Towards Compositional Text-to-Video Generation Paper • 2406.04277 • Published Jun 6, 2024 • 25
Enhancing Spatial Understanding in Image Generation via Reward Modeling Paper • 2602.24233 • Published Feb 27 • 59
Running Agents RBench Leaderboard 🦾 View and submit robot image‑to‑video model benchmark results
Rethinking Video Generation Model for the Embodied World Paper • 2601.15282 • Published Jan 21 • 45
Rethinking Video Generation Model for the Embodied World Paper • 2601.15282 • Published Jan 21 • 45
Running Agents RBench Leaderboard 🦾 View and submit robot image‑to‑video model benchmark results
Running Agents RBench Leaderboard 🦾 View and submit robot image‑to‑video model benchmark results
Focal Guidance: Unlocking Controllability from Semantic-Weak Layers in Video Diffusion Models Paper • 2601.07287 • Published Jan 12 • 6
CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance Paper • 2503.10391 • Published Mar 13, 2025 • 12
MagicComp: Training-free Dual-Phase Refinement for Compositional Video Generation Paper • 2503.14428 • Published Mar 18, 2025 • 8