Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Paper • 2507.10524 • Published Jul 14, 2025 • 73
ASA: Training-Free Representation Engineering for Tool-Calling Agents Paper • 2602.04935 • Published Feb 4 • 42
Query as Anchor: Scenario-Adaptive User Representation via Large Language Model Paper • 2602.14492 • Published Feb 16 • 18
Thinking with Drafting: Optical Decompression via Logical Reconstruction Paper • 2602.11731 • Published Feb 12 • 34
Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL Paper • 2602.03773 • Published Feb 3 • 13
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning Paper • 2602.10560 • Published Feb 11 • 31
Thinking with Comics: Enhancing Multimodal Reasoning through Structured Visual Storytelling Paper • 2602.02453 • Published Feb 2 • 36
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale Paper • 2602.05711 • Published Feb 5 • 12
LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning Paper • 2602.07075 • Published Feb 6 • 19
SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks Paper • 2602.06854 • Published Feb 6 • 6
LatentMem: Customizing Latent Memory for Multi-Agent Systems Paper • 2602.03036 • Published Feb 3 • 15
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published Jan 29 • 102
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models Paper • 2511.08577 • Published Nov 11, 2025 • 110
Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing Paper • 2602.02159 • Published Feb 2 • 1