Collections
Discover the best community collections!
Collections including paper arxiv:2512.02014
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 208 • 99 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 39 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
Perception-Aware Policy Optimization for Multimodal Reasoning
Paper • 2507.06448 • Published • 48 -
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning
Paper • 2507.05920 • Published • 12 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 218 -
Latent Chain-of-Thought for Visual Reasoning
Paper • 2510.23925 • Published • 10
-
Perception-Aware Policy Optimization for Multimodal Reasoning
Paper • 2507.06448 • Published • 48 -
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning
Paper • 2507.05920 • Published • 12 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 218 -
Latent Chain-of-Thought for Visual Reasoning
Paper • 2510.23925 • Published • 10
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 208 • 99 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 39 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88