A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
Paper
• 2510.23587
• Published • 67
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to
Embodied AI
Paper
• 2510.05684
• Published • 146
Thinking with Camera: A Unified Multimodal Model for Camera-Centric
Understanding and Generation
Paper
• 2510.08673
• Published • 127
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
Paper
• 2511.08892
• Published • 216
Table-R1: Inference-Time Scaling for Table Reasoning
Paper
• 2505.23621
• Published • 93
GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization
Paper
• 2511.15705
• Published • 98
SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs
Paper
• 2512.04746
• Published • 14
UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving
Paper
• 2512.09864
• Published • 12
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning
Paper
• 2512.07461
• Published • 79
N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models
Paper
• 2512.16561
• Published • 20
Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience
Paper
• 2512.17260
• Published • 52
LongVideoAgent: Multi-Agent Reasoning with Long Videos
Paper
• 2512.20618
• Published • 56
Step-DeepResearch Technical Report
Paper
• 2512.20491
• Published • 87
Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale
Paper
• 2512.10398
• Published • 13