hlttxdy
/

STAR-1_DeepSeek-R1-Distill-Qwen-7B_dpo_over_refusal_mix_safep0.7_epoch2_lr1e-6_beta0.05_ftx0.2

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Upload all_results.json with huggingface_hub

#1

by skyai798 - opened Sep 19, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

No description provided.

Upload all_results.json with huggingface_hubc384310e

hlttxdy changed pull request status to merged Sep 19, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment