december papers
updated
RobustFT: Robust Supervised Fine-tuning for Large Language Models under
Noisy Response
Paper
• 2412.14922
• Published • 88
B-STaR: Monitoring and Balancing Exploration and Exploitation in
Self-Taught Reasoners
Paper
• 2412.17256
• Published • 47
Paper
• 2412.16720
• Published • 37
Revisiting In-Context Learning with Long Context Language Models
Paper
• 2412.16926
• Published • 32
Paper
• 2412.15115
• Published • 377
How to Synthesize Text Data without Model Collapse?
Paper
• 2412.14689
• Published • 53
Thinking in Space: How Multimodal Large Language Models See, Remember,
and Recall Spaces
Paper
• 2412.14171
• Published • 25
Alignment faking in large language models
Paper
• 2412.14093
• Published • 10
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper
• 2412.09871
• Published • 108
Apollo: An Exploration of Video Understanding in Large Multimodal Models
Paper
• 2412.10360
• Published • 147
Paper
• 2412.08905
• Published • 123
Evaluating and Aligning CodeLLMs on Human Preference
Paper
• 2412.05210
• Published • 48
Hidden in the Noise: Two-Stage Robust Watermarking for Images
Paper
• 2412.04653
• Published • 30
Training Large Language Models to Reason in a Continuous Latent Space
Paper
• 2412.06769
• Published • 94
Evaluating Language Models as Synthetic Data Generators
Paper
• 2412.03679
• Published • 47
Surveying the Effects of Quality, Diversity, and Complexity in Synthetic
Data From Large Language Models
Paper
• 2412.02980
• Published • 15
Reverse Thinking Makes LLMs Stronger Reasoners
Paper
• 2411.19865
• Published • 23