Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ou474747
/
DeepSeek-R1-Distill-Qwen-1.5B-sft-cot-lr3e-5_sched-linear_ep3_bs8_gs4_multi
like
0
TensorBoard
Safetensors
qwen2
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
No model card
Downloads last month
1
Safetensors
Model size
2B params
Tensor type
BF16
·
Chat template
Files info
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support