-
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 54 -
Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models
Paper • 2412.05939 • Published • 15 -
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper • 2411.15124 • Published • 67 -
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Paper • 2504.13180 • Published • 20
Collections
Discover the best community collections!
Collections including paper arxiv:2506.04178
-
Large Reasoning Models Learn Better Alignment from Flawed Thinking
Paper • 2510.00938 • Published • 60 -
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT
Paper • 2509.19284 • Published • 23 -
Learning to Reason as Action Abstractions with Scalable Mid-Training RL
Paper • 2509.25810 • Published • 6 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 277
-
Reasoning Introduces New Poisoning Attacks Yet Makes Them More Complicated
Paper • 2509.05739 • Published • 2 -
Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers
Paper • 2509.03059 • Published • 25 -
Universal Deep Research: Bring Your Own Model and Strategy
Paper • 2509.00244 • Published • 14 -
<think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs
Paper • 2509.08358 • Published • 13
-
Learning to Reason without External Rewards
Paper • 2505.19590 • Published • 31 -
Scalable Best-of-N Selection for Large Language Models via Self-Certainty
Paper • 2502.18581 • Published -
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 94 -
Fractured Chain-of-Thought Reasoning
Paper • 2505.12992 • Published • 23
-
OmniSVG: A Unified Scalable Vector Graphics Generation Model
Paper • 2504.06263 • Published • 186 -
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models
Paper • 2510.11341 • Published • 35 -
SVGThinker: Instruction-Aligned and Reasoning-Driven Text-to-SVG Generation
Paper • 2509.24299 • Published • 1 -
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation
Paper • 2511.02778 • Published • 103
-
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 54 -
NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning
Paper • 2504.13941 • Published • 12 -
Retrieval-augmented reasoning with lean language models
Paper • 2508.11386 • Published • 5 -
Language Models that Think, Chat Better
Paper • 2509.20357 • Published • 1
-
open-thoughts/OpenThinker3-7B
Text Generation • 8B • Updated • 4.52k • • 135 -
open-thoughts/OpenThoughts3-1.2M
Viewer • Updated • 1.2M • 15.6k • 225 -
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 54 -
open-thoughts/OpenThinker3-1.5B
Text Generation • 2B • Updated • 6.18k • 14
-
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 54 -
Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models
Paper • 2412.05939 • Published • 15 -
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper • 2411.15124 • Published • 67 -
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Paper • 2504.13180 • Published • 20
-
OmniSVG: A Unified Scalable Vector Graphics Generation Model
Paper • 2504.06263 • Published • 186 -
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models
Paper • 2510.11341 • Published • 35 -
SVGThinker: Instruction-Aligned and Reasoning-Driven Text-to-SVG Generation
Paper • 2509.24299 • Published • 1 -
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation
Paper • 2511.02778 • Published • 103
-
Large Reasoning Models Learn Better Alignment from Flawed Thinking
Paper • 2510.00938 • Published • 60 -
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT
Paper • 2509.19284 • Published • 23 -
Learning to Reason as Action Abstractions with Scalable Mid-Training RL
Paper • 2509.25810 • Published • 6 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 277
-
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 54 -
NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning
Paper • 2504.13941 • Published • 12 -
Retrieval-augmented reasoning with lean language models
Paper • 2508.11386 • Published • 5 -
Language Models that Think, Chat Better
Paper • 2509.20357 • Published • 1
-
Reasoning Introduces New Poisoning Attacks Yet Makes Them More Complicated
Paper • 2509.05739 • Published • 2 -
Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers
Paper • 2509.03059 • Published • 25 -
Universal Deep Research: Bring Your Own Model and Strategy
Paper • 2509.00244 • Published • 14 -
<think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs
Paper • 2509.08358 • Published • 13
-
Learning to Reason without External Rewards
Paper • 2505.19590 • Published • 31 -
Scalable Best-of-N Selection for Large Language Models via Self-Certainty
Paper • 2502.18581 • Published -
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 94 -
Fractured Chain-of-Thought Reasoning
Paper • 2505.12992 • Published • 23
-
open-thoughts/OpenThinker3-7B
Text Generation • 8B • Updated • 4.52k • • 135 -
open-thoughts/OpenThoughts3-1.2M
Viewer • Updated • 1.2M • 15.6k • 225 -
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 54 -
open-thoughts/OpenThinker3-1.5B
Text Generation • 2B • Updated • 6.18k • 14