MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 11 days ago • 38
Demystifying When Pruning Works via Representation Hierarchies Paper • 2603.24652 • Published 13 days ago • 20
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published 11 days ago • 70
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 13 days ago • 107
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook Paper • 2604.02029 • Published 17 days ago • 140
PRBench: End-to-end Paper Reproduction in Physics Research Paper • 2603.27646 • Published 20 days ago • 29
On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models Paper • 2603.27481 • Published 21 days ago • 35
VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward Paper • 2603.26599 • Published 22 days ago • 63
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published 29 days ago • 338
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper • 2603.27538 • Published 20 days ago • 143
Emergent Social Intelligence Risks in Generative Multi-Agent Systems Paper • 2603.27771 • Published 20 days ago • 52
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills Paper • 2603.25158 • Published 23 days ago • 50
DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models Paper • 2603.23499 • Published 25 days ago • 51
Vega: Learning to Drive with Natural Language Instructions Paper • 2603.25741 • Published 23 days ago • 6
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale Paper • 2603.25040 • Published 23 days ago • 131
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation Paper • 2603.22117 • Published 26 days ago • 29
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 26 days ago • 123
Alignment Makes Language Models Normative, Not Descriptive Paper • 2603.17218 • Published Mar 17 • 46