Self-Rewarding Sequential Monte Carlo for Masked Diffusion Language Models Paper • 2602.01849 • Published Feb 2 • 5
MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome Paper • 2603.28407 • Published 19 days ago • 68
MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 11 days ago • 38
MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 11 days ago • 38
SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving Paper • 2601.01426 • Published Jan 4 • 24
LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction Paper • 2509.07403 • Published Sep 9, 2025 • 35
Electrocardiogram Instruction Tuning for Report Generation Paper • 2403.04945 • Published Mar 7, 2024 • 2
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models Paper • 2404.02657 • Published Apr 3, 2024 • 2
UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers Paper • 2301.13741 • Published Jan 31, 2023 • 1
D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models Paper • 2406.13035 • Published Jun 18, 2024 • 3
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies Paper • 2407.13623 • Published Jul 18, 2024 • 56
PhyX: Does Your Model Have the "Wits" for Physical Reasoning? Paper • 2505.15929 • Published May 21, 2025 • 49
The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs Paper • 2507.07562 • Published Jul 10, 2025 • 1
MMSearch-Plus: A Simple Yet Challenging Benchmark for Multimodal Browsing Agents Paper • 2508.21475 • Published Aug 29, 2025 • 2
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers Paper • 2305.17455 • Published May 27, 2023
UNComp: Can Matrix Entropy Uncover Sparsity? -- A Compressor Design from an Uncertainty-Aware Perspective Paper • 2410.03090 • Published Oct 4, 2024 • 1
ATTS: Asynchronous Test-Time Scaling via Conformal Prediction Paper • 2509.15148 • Published Sep 18, 2025 • 1