HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 5 days ago • 151
Combee: Scaling Prompt Learning for Self-Improving Language Model Agents Paper • 2604.04247 • Published 8 days ago • 25
MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale Paper • 2604.04771 • Published 7 days ago • 114
Video Generation Models as World Models: Efficient Paradigms, Architectures and Algorithms Paper • 2603.28489 • Published 13 days ago • 30
DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models Paper • 2603.23499 • Published 19 days ago • 51
Jackrong/Qwen3.5-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled Text Generation • 36B • Updated 7 days ago • 40.6k • 116
MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation Paper • 2603.16861 • Published 26 days ago • 9
TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas Paper • 2603.16448 • Published 26 days ago • 58
Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding Paper • 2603.19235 • Published 24 days ago • 95
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Paper • 2603.06569 • Published Mar 6 • 118
The Trinity of Consistency as a Defining Principle for General World Models Paper • 2602.23152 • Published Feb 26 • 201
Running on Zero Featured 17 Qwen3 VL Video Grounding 🥠 17 Text-guided object tracking, point tracking, reasoning.