Need to read
updated
Textbooks Are All You Need II: phi-1.5 technical report
Paper
• 2309.05463
• Published • 90
When Less is More: Investigating Data Pruning for Pretraining LLMs at
Scale
Paper
• 2309.04564
• Published • 17
Large-Scale Automatic Audiobook Creation
Paper
• 2309.03926
• Published • 55
The Languini Kitchen: Enabling Language Modelling Research at Different
Scales of Compute
Paper
• 2309.11197
• Published • 5
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper
• 2309.11495
• Published • 40
LMDX: Language Model-based Document Information Extraction and
Localization
Paper
• 2309.10952
• Published • 67
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper
• 2309.12307
• Published • 89
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language
Models
Paper
• 2309.12284
• Published • 19
Small-scale proxies for large-scale Transformer training instabilities
Paper
• 2309.14322
• Published • 22
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Paper
• 2309.14717
• Published • 46
Jointly Training Large Autoregressive Multimodal Models
Paper
• 2309.15564
• Published • 8