-
A Benchmark for Learning to Translate a New Language from One Grammar Book
Paper • 2309.16575 • Published • 1 -
Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?
Paper • 2409.19151 • Published -
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Paper • 2305.07759 • Published • 45 -
TinyLlama: An Open-Source Small Language Model
Paper • 2401.02385 • Published • 95
Collections
Discover the best community collections!
Collections including paper arxiv:2305.07759
-
vincentkoc/tiny_qa_benchmark
Viewer • Updated • 52 • 41 • 1 -
vincentkoc/tiny_qa_benchmark_pp
Viewer • Updated • 662 • 383 • 2 -
Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation
Paper • 2505.12058 • Published • 6 -
roneneldan/TinyStories
Viewer • Updated • 2.14M • 97.5k • 956
-
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper • 2407.03502 • Published • 51 -
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
Paper • 2406.08464 • Published • 72 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 259 -
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
Paper • 2402.10379 • Published • 31
-
Textbooks Are All You Need
Paper • 2306.11644 • Published • 154 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 90 -
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Paper • 2305.07759 • Published • 45 -
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper • 2406.20094 • Published • 107
-
A Benchmark for Learning to Translate a New Language from One Grammar Book
Paper • 2309.16575 • Published • 1 -
Can LLMs Really Learn to Translate a Low-Resource Language from One Grammar Book?
Paper • 2409.19151 • Published -
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Paper • 2305.07759 • Published • 45 -
TinyLlama: An Open-Source Small Language Model
Paper • 2401.02385 • Published • 95
-
vincentkoc/tiny_qa_benchmark
Viewer • Updated • 52 • 41 • 1 -
vincentkoc/tiny_qa_benchmark_pp
Viewer • Updated • 662 • 383 • 2 -
Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation
Paper • 2505.12058 • Published • 6 -
roneneldan/TinyStories
Viewer • Updated • 2.14M • 97.5k • 956
-
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper • 2407.03502 • Published • 51 -
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
Paper • 2406.08464 • Published • 72 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 259 -
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
Paper • 2402.10379 • Published • 31
-
Textbooks Are All You Need
Paper • 2306.11644 • Published • 154 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 90 -
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Paper • 2305.07759 • Published • 45 -
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper • 2406.20094 • Published • 107