SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing Paper โข 2604.04911 โข Published 11 days ago โข 35
ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts Paper โข 2507.20939 โข Published Jul 28, 2025 โข 57
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO Paper โข 2505.13031 โข Published May 19, 2025 โข 4