-
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Paper • 2501.17161 • Published • 125 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 60 -
Training Compute-Optimal Large Language Models
Paper • 2203.15556 • Published • 11 -
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Paper • 2305.10601 • Published • 15
Collections
Discover the best community collections!
Collections including paper arxiv:2305.10601
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 3.97M • • 13.3k -
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.18M • • 2.01k -
google/gemma-2-27b-it
Text Generation • 27B • Updated • 84.2k • 565 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 15
-
Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks
Paper • 2412.15605 • Published • 2 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 80 -
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity
Paper • 2403.14403 • Published • 7 -
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations
Paper • 2412.13171 • Published • 35
-
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models
Paper • 2310.04406 • Published • 10 -
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Paper • 2305.10601 • Published • 15 -
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models
Paper • 2404.02575 • Published • 50 -
Voyager: An Open-Ended Embodied Agent with Large Language Models
Paper • 2305.16291 • Published • 13
-
microsoft/ms_marco
Viewer • Updated • 1.11M • 22.3k • 240 -
sentence-transformers/all-nli
Viewer • Updated • 2.86M • 2.43k • 50 -
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Paper • 2005.11401 • Published • 14 -
Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models
Paper • 2402.04614 • Published • 3
-
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework
Paper • 2404.14619 • Published • 126 -
Multi-Head Mixture-of-Experts
Paper • 2404.15045 • Published • 60 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 259 -
Learn Your Reference Model for Real Good Alignment
Paper • 2404.09656 • Published • 90
-
Attention Is All You Need
Paper • 1706.03762 • Published • 120 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 26 -
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Paper • 1910.01108 • Published • 22 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 20
-
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Paper • 2501.17161 • Published • 125 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 60 -
Training Compute-Optimal Large Language Models
Paper • 2203.15556 • Published • 11 -
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Paper • 2305.10601 • Published • 15
-
microsoft/ms_marco
Viewer • Updated • 1.11M • 22.3k • 240 -
sentence-transformers/all-nli
Viewer • Updated • 2.86M • 2.43k • 50 -
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Paper • 2005.11401 • Published • 14 -
Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models
Paper • 2402.04614 • Published • 3
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 3.97M • • 13.3k -
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.18M • • 2.01k -
google/gemma-2-27b-it
Text Generation • 27B • Updated • 84.2k • 565 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 15
-
Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks
Paper • 2412.15605 • Published • 2 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 80 -
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity
Paper • 2403.14403 • Published • 7 -
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations
Paper • 2412.13171 • Published • 35
-
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework
Paper • 2404.14619 • Published • 126 -
Multi-Head Mixture-of-Experts
Paper • 2404.15045 • Published • 60 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 259 -
Learn Your Reference Model for Real Good Alignment
Paper • 2404.09656 • Published • 90
-
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models
Paper • 2310.04406 • Published • 10 -
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Paper • 2305.10601 • Published • 15 -
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models
Paper • 2404.02575 • Published • 50 -
Voyager: An Open-Ended Embodied Agent with Large Language Models
Paper • 2305.16291 • Published • 13
-
Attention Is All You Need
Paper • 1706.03762 • Published • 120 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 26 -
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Paper • 1910.01108 • Published • 22 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 20