Collections
Discover the best community collections!
Collections trending this week
-
Attention Is All You Need
Paper • 1706.03762 • Published • 120 -
Scaling Laws for Neural Language Models
Paper • 2001.08361 • Published • 10 -
Training Compute-Optimal Large Language Models
Paper • 2203.15556 • Published • 11 -
Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT
Paper • 2210.04186 • Published
-
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b
Viewer • Updated • 306k • 2.55k • 320 -
Alibaba-Apsara/DASD-4B-Thinking
Text Generation • Updated • 458 • 217 -
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob
Viewer • Updated • 435k • 762 • 58 -
Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview
Text Generation • Updated • 129 • 52
-
Attention Is All You Need
Paper • 1706.03762 • Published • 120 -
Scaling Laws for Neural Language Models
Paper • 2001.08361 • Published • 10 -
Training Compute-Optimal Large Language Models
Paper • 2203.15556 • Published • 11 -
Analogy Generation by Prompting Large Language Models: A Case Study of InstructGPT
Paper • 2210.04186 • Published
-
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b
Viewer • Updated • 306k • 2.55k • 320 -
Alibaba-Apsara/DASD-4B-Thinking
Text Generation • Updated • 458 • 217 -
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob
Viewer • Updated • 435k • 762 • 58 -
Alibaba-Apsara/DASD-30B-A3B-Thinking-Preview
Text Generation • Updated • 129 • 52