Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
PARTAGES-dev
's Collections
Encoder pretraining from scratch (commercial use)
Encoder continual pretraining (research use)
Qwen3+PDAPT+SLERP
Qwen3+PDAPT+SLERP
updated
Mar 10
Experiments conducted for the LREC paper ()
Upvote
-
PARTAGES-dev/Qwen3-8B-PDAPT-SLERP
Text Generation
•
8B
•
Updated
10 days ago
•
436
PARTAGES-dev/Qwen3-4B-PDAPT-SLERP
Text Generation
•
4B
•
Updated
Dec 3, 2025
•
259
Qwen/Qwen3-8B-Base
Text Generation
•
8B
•
Updated
May 21, 2025
•
495k
•
•
97
Qwen/Qwen3-4B-Base
Text Generation
•
4B
•
Updated
Jul 26, 2025
•
1.49M
•
84
Qwen/Qwen3-1.7B-Base
Text Generation
•
2B
•
Updated
Jul 26, 2025
•
328k
•
70
Qwen/Qwen3-0.6B-Base
Text Generation
•
Updated
Jul 26, 2025
•
270k
•
160
PARTAGES-dev/Qwen3-1.7B-PDAPT-SLERP
Text Generation
•
2B
•
Updated
Feb 25
•
181
PARTAGES-dev/Qwen3-0.6B-PDAPT-SLERP
Text Generation
•
0.8B
•
Updated
Dec 4, 2025
•
114
Upvote
-
Share collection
View history
Collection guide
Browse collections