Scaling Law for Quantization-Aware Training
Paper
• 2505.14302
• Published • 76
Paper
• 2505.14674
• Published • 37
Paper
• 2505.09388
• Published • 339
AdaptThink: Reasoning Models Can Learn When to Think
Paper
• 2505.13417
• Published • 83
Thinkless: LLM Learns When to Think
Paper
• 2505.13379
• Published • 50
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper
• 2505.03335
• Published • 191
Seed1.5-VL Technical Report
Paper
• 2505.07062
• Published • 157
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable
Speaker Encoder
Paper
• 2505.07916
• Published • 135
Chain-of-Model Learning for Language Model
Paper
• 2505.11820
• Published • 121
Emerging Properties in Unified Multimodal Pretraining
Paper
• 2505.14683
• Published • 134
Parallel Scaling Law for Language Models
Paper
• 2505.10475
• Published • 83
Flow-GRPO: Training Flow Matching Models via Online RL
Paper
• 2505.05470
• Published • 88
RM-R1: Reward Modeling as Reasoning
Paper
• 2505.02387
• Published • 81
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
Paper
• 2505.04588
• Published • 65
Scaling Reasoning, Losing Control: Evaluating Instruction Following in
Large Reasoning Models
Paper
• 2505.14810
• Published • 62
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous
Concept Space
Paper
• 2505.15778
• Published • 19
NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop
System from Hypothesis to Verification
Paper
• 2505.16938
• Published • 121
Learning to Reason via Mixture-of-Thought for Logical Reasoning
Paper
• 2505.15817
• Published • 18
One RL to See Them All: Visual Triple Unified Reinforcement Learning
Paper
• 2505.18129
• Published • 62
MemOS: A Memory OS for AI System
Paper
• 2507.03724
• Published • 166
4KAgent: Agentic Any Image to 4K Super-Resolution
Paper
• 2507.07105
• Published • 107
A Survey of Context Engineering for Large Language Models
Paper
• 2507.13334
• Published • 263