China models
updated
Text Generation
• Updated • 5.2k
• 32
internlm/internlm2-chat-1_8b
Text Generation
• 2B • Updated • 5.27k
• 35
Text Generation
• Updated • 24.5k
• 43
internlm/internlm2-chat-7b
Text Generation
• Updated • 47.3k
• 83
internlm/internlm2-base-20b
Text Generation
• Updated • 16.8k
• 8
Text Generation
• Updated • 20k
• 59
internlm/internlm2-chat-20b
Text Generation
• 20B • Updated • 19.6k
• 88
YeungNLP/firefly-pretrain-dataset
Viewer
• Updated • 2.46M • 287
• 42
9B • Updated • 363k
• 705
14B • Updated • 64.3k
• 268
9B • Updated • 8.78k
• 201
Text Generation
• 9B • Updated • 9.05k
• 145
Text Generation
• 685B • Updated • 3.98M
• • 13.3k
Text Generation
• 8B • Updated • 24
• 32
Text Generation
• 73B • Updated • 22
• • 32
Image-Text-to-Text
• 1B • Updated • 41k
• 96
Image-Text-to-Text
• Updated • 266
• 60
Image-Text-to-Text
• 5B • Updated • 74.4k
• 61
Image-Text-to-Text
• 9B • Updated • 1.23k
• 75
Image-Text-to-Text
• 16B • Updated • 166
• 98
Image-Text-to-Text
• 35B • Updated • 341
• 142
deepseek-ai/DeepSeek-R1-Zero
Text Generation
• Updated • 6.57k
• 950
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation
• 33B • Updated • 1.02M
• • 1.54k
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation
• Updated • 150k
• • 767
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation
• Updated • 1.99M
• • 855
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Text Generation
• 8B • Updated • 620k
• • 808
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
• 2B • Updated • 673k
• • 1.49k
Text Generation
• 685B • Updated • 836k
• • 4.06k
deepseek-ai/DeepSeek-V3-Base
Updated • 17.1k
• 1.69k
deepseek-ai/deepseek-math-7b-instruct
Text Generation
• Updated • 9.56k
• 150
deepseek-ai/deepseek-math-7b-base
Text Generation
• Updated • 7.61k
• 87
deepseek-ai/deepseek-math-7b-rl
Text Generation
• 7B • Updated • 3.5k
• 92
Text Generation
• 33B • Updated • 70.5k
• • 2.89k
Qwen/Qwen2.5-14B-Instruct-1M
Text Generation
• 15B • Updated • 3.47k
• • 332
Qwen/Qwen2.5-7B-Instruct-1M
Text Generation
• 8B • Updated • 106k
• • 363
qihoo360/TinyR1-32B-Preview
Text Generation
• 33B • Updated • 170
• • 324
Text Generation
• Updated • 2.36M
• • 683
Text Generation
• Updated • 2.93M
• • 386
Text Generation
• 8B • Updated • 8.64M
• • 1.05k
Text Generation
• Updated • 7.56M
• 600
Text Generation
• 2B • Updated • 7.33M
• • 450
Text Generation
• 0.8B • Updated • 15.6M
• 1.2k