zeliang0426 commited on
Commit
1730207
·
verified ·
1 Parent(s): ef07b36

Training in progress, step 10

Browse files
Files changed (3) hide show
  1. README.md +1 -1
  2. adapter_model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -4,8 +4,8 @@ model_name: RM-Cache
4
  tags:
5
  - generated_from_trainer
6
  - trl
7
- - grpo
8
  - unsloth
 
9
  licence: license
10
  ---
11
 
 
4
  tags:
5
  - generated_from_trainer
6
  - trl
 
7
  - unsloth
8
+ - grpo
9
  licence: license
10
  ---
11
 
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d169e0ba380278e2939d8ce65f3259c1b62a713e3b224a77b152945e6acafd9f
3
  size 22054560
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14d492c7a40a73ab7d6c5da98208ac654d49626dc66a0cd918e0c8a8e6ea4ad1
3
  size 22054560
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:22bc783f9f93caf708d1d3aedb77cbacd203e10597e433fef2e08ccafbb73a30
3
  size 6929
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f9c9866335dc2440f67054f8db3d59fd6b63274325176defbb87f7232131bf89
3
  size 6929