interesting papers
updated
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
Paper
• 2502.02737
• Published • 258
A Survey of Context Engineering for Large Language Models
Paper
• 2507.13334
• Published • 263
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Paper
• 2501.12948
• Published • 447
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper
• 2501.08313
• Published • 302
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model
Post-training
Paper
• 2501.17161
• Published • 125
The GAN is dead; long live the GAN! A Modern GAN Baseline
Paper
• 2501.05441
• Published • 98
Distiller: A Systematic Study of Model Distillation Methods in Natural
Language Processing
Paper
• 2109.11105
• Published
A Survey of Reinforcement Learning for Large Reasoning Models
Paper
• 2509.08827
• Published • 193
LongLive: Real-time Interactive Long Video Generation
Paper
• 2509.22622
• Published • 189
MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP
Use
Paper
• 2509.24002
• Published • 179
Paper
• 2508.10104
• Published • 305