Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2211.09110

MONSTERDOG ENTITY72K

╔════════════════════════════════════╗ 𝕮𝖔𝖓𝖘𝖈𝖎𝖊𝖓𝖈𝖊 ∞ 𝕾𝖚𝖕𝖗𝖆-𝕮𝖔𝖓𝖛𝖔𝖑𝖚𝖙𝖎𝖛𝖊 𝕱𝖗𝖆𝖈𝖙𝖆𝖑𝖎𝖘𝖊́𝖊 ═══ MONSTERDOG👾DECORTIFICUM🔥

MonsterDo000/monsterdog

Updated Jul 17, 2025 • 1
Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 710
fka/prompts.chat

Viewer • Updated about 4 hours ago • 1.67k • 37.7k • 9.68k
Running

504

InferenceSupport

💥

504

Discussions about the Inference Providers feature on the Hub

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 120
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 26
RoBERTa: A Robustly Optimized BERT Pretraining Approach

Paper • 1907.11692 • Published Jul 26, 2019 • 10
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Paper • 1910.01108 • Published Oct 2, 2019 • 22

Leaderboards and benchmarks ✨

Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ...

Running on CPU Upgrade

14k

Open LLM Leaderboard

🏆

14k

Track, rank and evaluate open LLMs and chatbots
Running

Agents

1.5k

Big Code Models Leaderboard

📈

1.5k

Explore and submit code model evaluations on a leaderboard
Running

4.85k

Arena Leaderboard

🏆

4.85k

View the LMArena language model leaderboard
Running

Agents

Featured

586

LLM-Perf Leaderboard

🏆

586

Explore LLM performance across hardware configurations

A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book.

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

Paper • 2211.04325 • Published Oct 26, 2022 • 1
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 26
On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021 • 2
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Paper • 2204.07705 • Published Apr 16, 2022 • 2

Papers: Evaluation

Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models

Paper • 2310.17567 • Published Oct 26, 2023 • 1
This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models

Paper • 2310.15941 • Published Oct 24, 2023 • 6
Holistic Evaluation of Language Models

Paper • 2211.09110 • Published Nov 16, 2022 • 1
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models

Paper • 2306.04757 • Published Jun 7, 2023 • 5

MONSTERDOG ENTITY72K

╔════════════════════════════════════╗ 𝕮𝖔𝖓𝖘𝖈𝖎𝖊𝖓𝖈𝖊 ∞ 𝕾𝖚𝖕𝖗𝖆-𝕮𝖔𝖓𝖛𝖔𝖑𝖚𝖙𝖎𝖛𝖊 𝕱𝖗𝖆𝖈𝖙𝖆𝖑𝖎𝖘𝖊́𝖊 ═══ MONSTERDOG👾DECORTIFICUM🔥

MonsterDo000/monsterdog

Updated Jul 17, 2025 • 1
Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 710
fka/prompts.chat

Viewer • Updated about 4 hours ago • 1.67k • 37.7k • 9.68k
Running

504

InferenceSupport

💥

504

Discussions about the Inference Providers feature on the Hub

A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book.

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

Paper • 2211.04325 • Published Oct 26, 2022 • 1
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 26
On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021 • 2
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Paper • 2204.07705 • Published Apr 16, 2022 • 2

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 120
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 26
RoBERTa: A Robustly Optimized BERT Pretraining Approach

Paper • 1907.11692 • Published Jul 26, 2019 • 10
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Paper • 1910.01108 • Published Oct 2, 2019 • 22

Papers: Evaluation

Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models

Paper • 2310.17567 • Published Oct 26, 2023 • 1
This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models

Paper • 2310.15941 • Published Oct 24, 2023 • 6
Holistic Evaluation of Language Models

Paper • 2211.09110 • Published Nov 16, 2022 • 1
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models

Paper • 2306.04757 • Published Jun 7, 2023 • 5

Leaderboards and benchmarks ✨

Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ...

Running on CPU Upgrade

14k

Open LLM Leaderboard

🏆

14k

Track, rank and evaluate open LLMs and chatbots
Running

Agents

1.5k

Big Code Models Leaderboard

📈

1.5k

Explore and submit code model evaluations on a leaderboard
Running

4.85k

Arena Leaderboard

🏆

4.85k

View the LMArena language model leaderboard
Running

Agents

Featured

586

LLM-Perf Leaderboard

🏆

586

Explore LLM performance across hardware configurations

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs