-
Textbooks Are All You Need
Paper β’ 2306.11644 β’ Published β’ 154 -
Self-Improving VLM Judges Without Human Annotations
Paper β’ 2512.05145 β’ Published β’ 20 -
FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing
Paper β’ 2601.01720 β’ Published β’ 6 -
MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Paper β’ 2511.09067 β’ Published β’ 2
Collections
Discover the best community collections!
Collections including paper arxiv:2306.11644
-
Attention Is All You Need
Paper β’ 1706.03762 β’ Published β’ 121 -
Language Models are Few-Shot Learners
Paper β’ 2005.14165 β’ Published β’ 20 -
LLaMA: Open and Efficient Foundation Language Models
Paper β’ 2302.13971 β’ Published β’ 23 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper β’ 2307.09288 β’ Published β’ 251
-
Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning
Paper β’ 2211.04325 β’ Published β’ 1 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper β’ 1810.04805 β’ Published β’ 26 -
On the Opportunities and Risks of Foundation Models
Paper β’ 2108.07258 β’ Published β’ 2 -
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Paper β’ 2204.07705 β’ Published β’ 2
-
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper β’ 2407.03502 β’ Published β’ 51 -
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
Paper β’ 2406.08464 β’ Published β’ 72 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper β’ 2404.14219 β’ Published β’ 259 -
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
Paper β’ 2402.10379 β’ Published β’ 31
-
Textbooks Are All You Need
Paper β’ 2306.11644 β’ Published β’ 154 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper β’ 2501.12948 β’ Published β’ 447 -
Muon is Scalable for LLM Training
Paper β’ 2502.16982 β’ Published β’ 12 -
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Paper β’ 2406.17557 β’ Published β’ 102
-
Textbooks Are All You Need
Paper β’ 2306.11644 β’ Published β’ 154 -
Self-Improving VLM Judges Without Human Annotations
Paper β’ 2512.05145 β’ Published β’ 20 -
FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing
Paper β’ 2601.01720 β’ Published β’ 6 -
MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
Paper β’ 2511.09067 β’ Published β’ 2
-
Textbooks Are All You Need
Paper β’ 2306.11644 β’ Published β’ 154 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper β’ 2501.12948 β’ Published β’ 447 -
Muon is Scalable for LLM Training
Paper β’ 2502.16982 β’ Published β’ 12 -
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Paper β’ 2406.17557 β’ Published β’ 102
-
Attention Is All You Need
Paper β’ 1706.03762 β’ Published β’ 121 -
Language Models are Few-Shot Learners
Paper β’ 2005.14165 β’ Published β’ 20 -
LLaMA: Open and Efficient Foundation Language Models
Paper β’ 2302.13971 β’ Published β’ 23 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper β’ 2307.09288 β’ Published β’ 251
-
Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning
Paper β’ 2211.04325 β’ Published β’ 1 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper β’ 1810.04805 β’ Published β’ 26 -
On the Opportunities and Risks of Foundation Models
Paper β’ 2108.07258 β’ Published β’ 2 -
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Paper β’ 2204.07705 β’ Published β’ 2
-
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper β’ 2407.03502 β’ Published β’ 51 -
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing
Paper β’ 2406.08464 β’ Published β’ 72 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper β’ 2404.14219 β’ Published β’ 259 -
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
Paper β’ 2402.10379 β’ Published β’ 31