End of training

Files changed (6) hide show

README.md CHANGED Viewed

@@ -4,10 +4,10 @@ library_name: transformers
 model_name: Qwen3-4B-Instruct-2507-SFT-tr5
 tags:
 - generated_from_trainer
-- trl
-- trackio
-- trackio:https://huggingface.co/spaces/hf-imo-colab/trackio-distillation-sft
 - sft
 - trl-internal
 licence: license
 ---
@@ -30,7 +30,7 @@ print(output["generated_text"])
 ## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/huggingface/imo-distillation/runs/4uhf3egj)
 This model was trained with SFT.

 model_name: Qwen3-4B-Instruct-2507-SFT-tr5
 tags:
 - generated_from_trainer
 - sft
+- trackio:https://huggingface.co/spaces/hf-imo-colab/trackio-distillation-sft
+- trackio
+- trl
 - trl-internal
 licence: license
 ---
 ## Training procedure
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/huggingface/imo-distillation/runs/menw08rt)
 This model was trained with SFT.

all_results.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
     "epoch": 4.6268656716417915,
     "total_flos": 829937030004736.0,
-    "train_loss": 0.3161669834246559,
-    "train_runtime": 18599.2522,
     "train_samples": 4281,
-    "train_samples_per_second": 1.067,
     "train_steps_per_second": 0.033
 }

 {
     "epoch": 4.6268656716417915,
     "total_flos": 829937030004736.0,
+    "train_loss": 0.4202386662844689,
+    "train_runtime": 18585.0074,
     "train_samples": 4281,
+    "train_samples_per_second": 1.068,
     "train_steps_per_second": 0.033
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:586bde3de3e0548d8b1e1449a19956bfa34fd86f55775ca4a8e0b94dae8cbf92
 size 8044982080

 version https://git-lfs.github.com/spec/v1
+oid sha256:f48cc1c0bb7c443abca2a6c7afef30c5b7e16cb0af9657bb269183c49ca76a76
 size 8044982080

train_results.json CHANGED Viewed

@@ -1,9 +1,9 @@
 {
     "epoch": 4.6268656716417915,
     "total_flos": 829937030004736.0,
-    "train_loss": 0.3161669834246559,
-    "train_runtime": 18599.2522,
     "train_samples": 4281,
-    "train_samples_per_second": 1.067,
     "train_steps_per_second": 0.033
 }

 {
     "epoch": 4.6268656716417915,
     "total_flos": 829937030004736.0,
+    "train_loss": 0.4202386662844689,
+    "train_runtime": 18585.0074,
     "train_samples": 4281,
+    "train_samples_per_second": 1.068,
     "train_steps_per_second": 0.033
 }

trainer_state.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e45aaeff312780d39bc02970f00193a16717653200a4bd41605fe0c011a5c30f
 size 7633

 version https://git-lfs.github.com/spec/v1
+oid sha256:3eec048e628a653c916a3717a9f2d295434c00bf772eb742f14f7f0d21a36376
 size 7633