cfei621 commited on
Commit
835faae
·
verified ·
1 Parent(s): eaf0f25

Model save

Browse files
README.md CHANGED
@@ -26,7 +26,7 @@ print(output["generated_text"])
26
 
27
  ## Training procedure
28
 
29
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/cfei-kaust/huggingface/runs/dkas6kch)
30
 
31
 
32
  This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
 
26
 
27
  ## Training procedure
28
 
29
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/cfei-kaust/huggingface/runs/g4fcc6et)
30
 
31
 
32
  This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
all_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "total_flos": 0.0,
3
- "train_loss": 1.366488031092214,
4
- "train_runtime": 85280.8428,
5
- "train_samples": 8000,
6
- "train_samples_per_second": 0.094,
7
- "train_steps_per_second": 0.023
8
  }
 
1
  {
2
  "total_flos": 0.0,
3
+ "train_loss": 2.0727225511886656,
4
+ "train_runtime": 64632.8349,
5
+ "train_samples": 5455,
6
+ "train_samples_per_second": 0.084,
7
+ "train_steps_per_second": 0.021
8
  }
model-00001-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:23dbea6fb8470364f656ec8a3c10856046283811b9d104fa7d53efbd9f06edd0
3
  size 4976698672
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aefef14d0fa0e535974078bf6bcb49d7c8df57a6e51ee5d75ca97b8a3d8109e2
3
  size 4976698672
model-00002-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9bed3343ae10384b135533e32692f2511bba5a613e797f0e562f9e35537b7fd6
3
  size 4999802720
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c4e3bd4d81263468dbae202ea58905de4fc00db4f1b0c01b0917b153dfa77e67
3
  size 4999802720
model-00003-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:309677f767f866c1fda4f289bbb5e0ca769db6e4a5f68b175a71fa7e7f3bb4a4
3
  size 4915916176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0feb6e3fd440cd1b7db08ccf066c480172161987639690dda3ca7680c930f6ef
3
  size 4915916176
model-00004-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4c4ee711d6c8eb3042f76e33b0e8aed2dbea45efe099114f7e98ee0ad7a164cc
3
  size 1168138808
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:646ec58c6e3cd775c7ec73fa0c632e2d155fb68dadb8ec308cd29d06f7567d9a
3
  size 1168138808
train_results.json CHANGED
@@ -1,8 +1,8 @@
1
  {
2
  "total_flos": 0.0,
3
- "train_loss": 1.366488031092214,
4
- "train_runtime": 85280.8428,
5
- "train_samples": 8000,
6
- "train_samples_per_second": 0.094,
7
- "train_steps_per_second": 0.023
8
  }
 
1
  {
2
  "total_flos": 0.0,
3
+ "train_loss": 2.0727225511886656,
4
+ "train_runtime": 64632.8349,
5
+ "train_samples": 5455,
6
+ "train_samples_per_second": 0.084,
7
+ "train_steps_per_second": 0.021
8
  }
trainer_state.json CHANGED
The diff for this file is too large to render. See raw diff