Collections
Discover the best community collections!
Collections including paper arxiv:2103.00020
-
Neural Machine Translation by Jointly Learning to Align and Translate
Paper • 1409.0473 • Published • 7 -
Attention Is All You Need
Paper • 1706.03762 • Published • 120 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 26 -
Hierarchical Reasoning Model
Paper • 2506.21734 • Published • 50
-
sentence-transformers/all-mpnet-base-v2
Sentence Similarity • 0.1B • Updated • 33M • • 1.28k -
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper • 1910.10683 • Published • 18 -
google-t5/t5-base
Translation • Updated • 1.4M • • 773 -
Attention Is All You Need
Paper • 1706.03762 • Published • 120
-
MIO: A Foundation Model on Multimodal Tokens
Paper • 2409.17692 • Published • 53 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15 -
Going deeper with Image Transformers
Paper • 2103.17239 • Published -
Training data-efficient image transformers & distillation through attention
Paper • 2012.12877 • Published • 2
-
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 245 -
A Survey on Diffusion Language Models
Paper • 2508.10875 • Published • 34 -
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 17 -
Denoising Diffusion Probabilistic Models
Paper • 2006.11239 • Published • 9
-
Transporter Networks: Rearranging the Visual World for Robotic Manipulation
Paper • 2010.14406 • Published -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 21 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15
-
Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning
Paper • 2211.04325 • Published • 1 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 26 -
On the Opportunities and Risks of Foundation Models
Paper • 2108.07258 • Published • 2 -
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Paper • 2204.07705 • Published • 2
-
Rich feature hierarchies for accurate object detection and semantic segmentation
Paper • 1311.2524 • Published • 1 -
DeepPose: Human Pose Estimation via Deep Neural Networks
Paper • 1312.4659 • Published • 1 -
Generative Adversarial Networks
Paper • 1406.2661 • Published • 5 -
scikit-image: Image processing in Python
Paper • 1407.6245 • Published • 1
-
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 245 -
A Survey on Diffusion Language Models
Paper • 2508.10875 • Published • 34 -
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 17 -
Denoising Diffusion Probabilistic Models
Paper • 2006.11239 • Published • 9
-
Neural Machine Translation by Jointly Learning to Align and Translate
Paper • 1409.0473 • Published • 7 -
Attention Is All You Need
Paper • 1706.03762 • Published • 120 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 26 -
Hierarchical Reasoning Model
Paper • 2506.21734 • Published • 50
-
Transporter Networks: Rearranging the Visual World for Robotic Manipulation
Paper • 2010.14406 • Published -
Learning Transferable Visual Models From Natural Language Supervision
Paper • 2103.00020 • Published • 21 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15
-
Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning
Paper • 2211.04325 • Published • 1 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 26 -
On the Opportunities and Risks of Foundation Models
Paper • 2108.07258 • Published • 2 -
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Paper • 2204.07705 • Published • 2
-
sentence-transformers/all-mpnet-base-v2
Sentence Similarity • 0.1B • Updated • 33M • • 1.28k -
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper • 1910.10683 • Published • 18 -
google-t5/t5-base
Translation • Updated • 1.4M • • 773 -
Attention Is All You Need
Paper • 1706.03762 • Published • 120
-
MIO: A Foundation Model on Multimodal Tokens
Paper • 2409.17692 • Published • 53 -
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Paper • 2010.11929 • Published • 15 -
Going deeper with Image Transformers
Paper • 2103.17239 • Published -
Training data-efficient image transformers & distillation through attention
Paper • 2012.12877 • Published • 2
-
Rich feature hierarchies for accurate object detection and semantic segmentation
Paper • 1311.2524 • Published • 1 -
DeepPose: Human Pose Estimation via Deep Neural Networks
Paper • 1312.4659 • Published • 1 -
Generative Adversarial Networks
Paper • 1406.2661 • Published • 5 -
scikit-image: Image processing in Python
Paper • 1407.6245 • Published • 1