Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
hlttxdy
/
STAR-1_DeepSeek-R1-Distill-Qwen-7B_dpo_over_refusal_mix_safep0.7_epoch2_lr1e-6_beta0.05_ftx0.2
like
0
Text Generation
Transformers
Safetensors
qwen2
llama-factory
full
Generated from Trainer
conversational
text-generation-inference
License:
other
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
Upload all_results.json with huggingface_hub
#1
by
skyai798
- opened
Sep 19, 2025
base:
refs/heads/main
←
from:
refs/pr/1
Discussion
Files changed
+8
-0
skyai798
Sep 19, 2025
No description provided.
Upload all_results.json with huggingface_hub
c384310e
hlttxdy
changed pull request status to
merged
Sep 19, 2025
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment