Collections
Discover the best community collections!
Collections including paper arxiv:2101.00027
-
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Paper • 2211.05100 • Published • 37 -
CsFEVER and CTKFacts: Acquiring Czech data for fact verification
Paper • 2201.11115 • Published -
Training language models to follow instructions with human feedback
Paper • 2203.02155 • Published • 24 -
FinGPT: Large Generative Models for a Small Language
Paper • 2311.05640 • Published • 30
-
SMOTE: Synthetic Minority Over-sampling Technique
Paper • 1106.1813 • Published • 1 -
Scikit-learn: Machine Learning in Python
Paper • 1201.0490 • Published • 1 -
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Paper • 1406.1078 • Published • 1 -
Distributed Representations of Sentences and Documents
Paper • 1405.4053 • Published
-
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Paper • 2401.16380 • Published • 53 -
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
Paper • 2602.05400 • Published • 352 -
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Paper • 2101.00027 • Published • 10
-
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements
Paper • 2210.01970 • Published • 13 -
Zephyr: Direct Distillation of LM Alignment
Paper • 2310.16944 • Published • 123 -
Datasets: A Community Library for Natural Language Processing
Paper • 2109.02846 • Published • 14 -
HuggingFace's Transformers: State-of-the-art Natural Language Processing
Paper • 1910.03771 • Published • 22
-
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Paper • 2306.01116 • Published • 44 -
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Paper • 2205.14135 • Published • 15 -
RoFormer: Enhanced Transformer with Rotary Position Embedding
Paper • 2104.09864 • Published • 17 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 20
-
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling
Paper • 2401.16380 • Published • 53 -
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
Paper • 2602.05400 • Published • 352 -
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
Paper • 2101.00027 • Published • 10
-
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements
Paper • 2210.01970 • Published • 13 -
Zephyr: Direct Distillation of LM Alignment
Paper • 2310.16944 • Published • 123 -
Datasets: A Community Library for Natural Language Processing
Paper • 2109.02846 • Published • 14 -
HuggingFace's Transformers: State-of-the-art Natural Language Processing
Paper • 1910.03771 • Published • 22
-
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Paper • 2211.05100 • Published • 37 -
CsFEVER and CTKFacts: Acquiring Czech data for fact verification
Paper • 2201.11115 • Published -
Training language models to follow instructions with human feedback
Paper • 2203.02155 • Published • 24 -
FinGPT: Large Generative Models for a Small Language
Paper • 2311.05640 • Published • 30
-
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Paper • 2306.01116 • Published • 44 -
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Paper • 2205.14135 • Published • 15 -
RoFormer: Enhanced Transformer with Rotary Position Embedding
Paper • 2104.09864 • Published • 17 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 20
-
SMOTE: Synthetic Minority Over-sampling Technique
Paper • 1106.1813 • Published • 1 -
Scikit-learn: Machine Learning in Python
Paper • 1201.0490 • Published • 1 -
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Paper • 1406.1078 • Published • 1 -
Distributed Representations of Sentences and Documents
Paper • 1405.4053 • Published