To Read
updated
Writing in the Margins: Better Inference Pattern for Long Context
Retrieval
Paper
• 2408.14906
• Published • 144
Training Language Models to Self-Correct via Reinforcement Learning
Paper
• 2409.12917
• Published • 140
Towards a Unified View of Preference Learning for Large Language Models:
A Survey
Paper
• 2409.02795
• Published • 72
Attention Heads of Large Language Models: A Survey
Paper
• 2409.03752
• Published • 92
Building and better understanding vision-language models: insights and
future directions
Paper
• 2408.12637
• Published • 133
Transformer Explainer: Interactive Learning of Text-Generative Models
Paper
• 2408.04619
• Published • 175
Gemma 2: Improving Open Language Models at a Practical Size
Paper
• 2408.00118
• Published • 78
Why Does the Effective Context Length of LLMs Fall Short?
Paper
• 2410.18745
• Published • 17
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free
Paper
• 2410.10814
• Published • 51
Toward General Instruction-Following Alignment for Retrieval-Augmented
Generation
Paper
• 2410.09584
• Published • 48
Can Knowledge Editing Really Correct Hallucinations?
Paper
• 2410.16251
• Published • 55
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A
Gradient Perspective
Paper
• 2410.23743
• Published • 64
Scaling Latent Reasoning via Looped Language Models
Paper
• 2510.25741
• Published • 229
DeepAgent: A General Reasoning Agent with Scalable Toolsets
Paper
• 2510.21618
• Published • 103
Reasoning with Sampling: Your Base Model is Smarter Than You Think
Paper
• 2510.14901
• Published • 48
Continual Learning via Sparse Memory Finetuning
Paper
• 2510.15103
• Published • 3