Grounding Everything in Tokens for Multimodal Large Language Models Paper • 2512.10554 • Published Dec 11, 2025 • 1
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation Paper • 2603.22117 • Published 25 days ago • 29
view article Article Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments Jan 20 • 12
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning Paper • 2603.17024 • Published Mar 17 • 109
RubricBench: Aligning Model-Generated Rubrics with Human Standards Paper • 2603.01562 • Published Mar 2 • 63
Imagination Helps Visual Reasoning, But Not Yet in Latent Space Paper • 2602.22766 • Published Feb 26 • 44
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference Paper • 2602.21548 • Published Feb 25 • 50
view article Article OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments +3 Feb 12 • 32
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published Feb 11 • 195
Towards Pixel-Level VLM Perception via Simple Points Prediction Paper • 2601.19228 • Published Jan 27 • 18
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective Jan 27 • 71
VISTA-PATH: An interactive foundation model for pathology image segmentation and quantitative analysis in computational pathology Paper • 2601.16451 • Published Jan 23 • 3
Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow Paper • 2601.14243 • Published Jan 20 • 23
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders Paper • 2601.10332 • Published Jan 15 • 31
VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation Paper • 2601.02256 • Published Jan 5 • 33
Enhancing Inflation Nowcasting with LLM: Sentiment Analysis on News Paper • 2410.20198 • Published Oct 26, 2024 • 1