Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
youngzhong
's Collections
SOD
SOD
updated
about 10 hours ago
SOD (Step-wise On-policy Distillation) model family for small language model agents.
Upvote
-
youngzhong/SOD-0.6B
Text Generation
•
0.8B
•
Updated
about 10 hours ago
youngzhong/SOD-1.7B
Text Generation
•
2B
•
Updated
about 10 hours ago
youngzhong/SOD-GRPO_teacher-4B
Text Generation
•
4B
•
Updated
about 10 hours ago
Upvote
-
Share collection
View history
Collection guide
Browse collections