V12X-ksr commited on
Commit
814f526
·
verified ·
1 Parent(s): c6dc812

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.2436
19
 
20
  ## Model description
21
 
@@ -41,25 +41,22 @@ The following hyperparameters were used during training:
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
  - lr_scheduler_warmup_steps: 500
44
- - num_epochs: 8
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | 1.2421 | 1.0 | 474 | 1.2910 |
51
- | 1.04 | 2.0 | 948 | 1.2588 |
52
- | 0.991 | 3.0 | 1422 | 1.2436 |
53
- | 0.5815 | 4.0 | 1896 | 1.5422 |
54
- | 0.8148 | 5.0 | 2370 | 1.6783 |
55
- | 0.2292 | 6.0 | 2844 | 2.3901 |
56
- | 0.0736 | 7.0 | 3318 | 2.5051 |
57
- | 0.0796 | 8.0 | 3792 | 2.7128 |
58
 
59
 
60
  ### Framework versions
61
 
62
  - Transformers 4.35.2
63
- - Pytorch 2.1.0+cu118
64
- - Datasets 2.15.0
65
- - Tokenizers 0.15.0
 
15
 
16
  This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.8132
19
 
20
  ## Model description
21
 
 
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
  - lr_scheduler_warmup_steps: 500
44
+ - num_epochs: 5
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | 0.9926 | 1.0 | 1000 | 0.8635 |
51
+ | 0.6624 | 2.0 | 2000 | 0.8280 |
52
+ | 0.5605 | 3.0 | 3000 | 0.8767 |
53
+ | 0.7377 | 4.0 | 4000 | 0.8132 |
54
+ | 0.7228 | 5.0 | 5000 | 0.8176 |
 
 
 
55
 
56
 
57
  ### Framework versions
58
 
59
  - Transformers 4.35.2
60
+ - Pytorch 2.1.0+cu121
61
+ - Datasets 2.16.1
62
+ - Tokenizers 0.15.1
logs/events.out.tfevents.1706862228.f231d2d3dc2f.1609.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6947259c724b87c98bd93e7d4e90d3a910796c2f539339f5945605a401cd8d02
3
- size 68135
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0ed3519123d8499012160ca9a4b71d03e50832158dbeb2b8b3f446981f03b6f1
3
+ size 84460
logs/events.out.tfevents.1706865287.f231d2d3dc2f.1609.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ac17c08330adf1c367a86542cdaf889bd9861a69820c41f9889982f3b41e0597
3
+ size 359