data: https://www.kaggle.com/datasets/warkingleo2000/RISK-KD/
kaggle datasets download warkingleo2000/risk-kd
-
phanviethoang1512/llama3.2-1b-deita-dpo-dpo_teacher
8B • Updated • 4 -
phanviethoang1512/llama3.2-1b-deita-dpo-TVKD
1B • Updated • 4 -
phanviethoang1512/llama3.2-1b-deita-dpo-ref_teacher
Text Generation • 8B • Updated • 111 -
phanviethoang1512/llama3.2-1b-deita-dpo-student_sft_init
Text Generation • 1B • Updated • 132