HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published 3 days ago • 61
Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw Paper • 2604.04759 • Published 12 days ago • 22
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild Paper • 2603.17187 • Published about 1 month ago • 138
Story-Adapter: A Training-free Iterative Framework for Long Story Visualization Paper • 2410.06244 • Published Oct 8, 2024 • 20
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation Paper • 2601.15369 • Published Jan 21 • 21
When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought Paper • 2511.02779 • Published Nov 4, 2025 • 60
MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs Paper • 2504.06897 • Published Apr 9, 2025 • 1
Discrete Diffusion Models with MLLMs for Unified Medical Multimodal Generation Paper • 2510.06131 • Published Oct 7, 2025 • 11
POSTER++: A simpler and stronger facial expression recognition network Paper • 2301.12149 • Published Jan 28, 2023 • 1