Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis Paper • 2603.29620 • Published 19 days ago • 46
Gen-Searcher: Reinforcing Agentic Search for Image Generation Paper • 2603.28767 • Published 19 days ago • 57
Thinking in Dynamics: How Multimodal Large Language Models Perceive, Track, and Reason Dynamics in Physical 4D World Paper • 2603.12746 • Published Mar 13 • 1
IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering Paper • 2506.23329 • Published Jun 29, 2025 • 8
Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models Paper • 2511.00503 • Published Nov 1, 2025 • 2
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling Paper • 2512.03000 • Published Dec 2, 2025 • 37
JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization Paper • 2511.23002 • Published Nov 28, 2025 • 26
JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization Paper • 2511.23002 • Published Nov 28, 2025 • 26
Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models Paper • 2511.00503 • Published Nov 1, 2025 • 2
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling Paper • 2512.03000 • Published Dec 2, 2025 • 37
view post Post 2173 JavisArt has been the focus of attention in this week's “Space of the Week.” We welcome more interested friends to test it out ! LYL1015/JarvisArt-Preview See translation 2 replies · 👍 4 4 + Reply