SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding Paper • 2604.09557 • Published Feb 10 • 10
From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models Paper • 2604.09459 • Published 3 days ago • 11
Mobile GUI Agent Privacy Personalization with Trajectory Induced Preference Optimization Paper • 2604.11259 • Published 3 days ago • 11
Agentic Aggregation for Parallel Scaling of Long-Horizon Agentic Tasks Paper • 2604.11753 • Published 3 days ago • 13
Solving Physics Olympiad via Reinforcement Learning on Physics Simulators Paper • 2604.11805 • Published 3 days ago • 15
Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs Paper • 2604.10480 • Published 4 days ago • 17
CocoaBench: Evaluating Unified Digital Agents in the Wild Paper • 2604.11201 • Published 3 days ago • 32
Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models Paper • 2604.10949 • Published 3 days ago • 38
Strips as Tokens: Artist Mesh Generation with Native UV Segmentation Paper • 2604.09132 • Published 6 days ago • 48
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation Paper • 2604.10098 • Published 5 days ago • 69
The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping Paper • 2604.11297 • Published 3 days ago • 92
Cactus: Accelerating Auto-Regressive Decoding with Constrained Acceptance Speculative Sampling Paper • 2604.04987 • Published 11 days ago • 3
Initialisation Determines the Basin: Efficient Codebook Optimisation for Extreme LLM Quantization Paper • 2604.08118 • Published 7 days ago • 2
MixFlow: Mixed Source Distributions Improve Rectified Flows Paper • 2604.09181 • Published 6 days ago • 3
On Semiotic-Grounded Interpretive Evaluation of Generative Art Paper • 2604.08641 • Published 7 days ago • 4