FiveC commited on
Commit
5a96d9e
·
verified ·
1 Parent(s): bd07204

End of training

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [FiveC/BartTay](https://huggingface.co/FiveC/BartTay) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.1628
22
- - Sacrebleu: 19.4960
23
 
24
  ## Model description
25
 
@@ -42,7 +42,7 @@ The following hyperparameters were used during training:
42
  - train_batch_size: 32
43
  - eval_batch_size: 32
44
  - seed: 42
45
- - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
  - num_epochs: 3
48
  - mixed_precision_training: Native AMP
@@ -51,14 +51,14 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Sacrebleu |
53
  |:-------------:|:-----:|:----:|:---------------:|:---------:|
54
- | 0.2227 | 1.0 | 2569 | 0.1660 | 16.3996 |
55
- | 0.1493 | 2.0 | 5138 | 0.1605 | 18.7173 |
56
- | 0.1216 | 3.0 | 7707 | 0.1628 | 19.4960 |
57
 
58
 
59
  ### Framework versions
60
 
61
  - Transformers 4.57.3
62
- - Pytorch 2.9.0+cu126
63
- - Datasets 4.0.0
64
  - Tokenizers 0.22.1
 
18
 
19
  This model is a fine-tuned version of [FiveC/BartTay](https://huggingface.co/FiveC/BartTay) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.1557
22
+ - Sacrebleu: 20.5892
23
 
24
  ## Model description
25
 
 
42
  - train_batch_size: 32
43
  - eval_batch_size: 32
44
  - seed: 42
45
+ - optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
  - num_epochs: 3
48
  - mixed_precision_training: Native AMP
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Sacrebleu |
53
  |:-------------:|:-----:|:----:|:---------------:|:---------:|
54
+ | 0.1893 | 1.0 | 3209 | 0.1553 | 15.6992 |
55
+ | 0.1188 | 2.0 | 6418 | 0.1503 | 19.2110 |
56
+ | 0.0966 | 3.0 | 9627 | 0.1557 | 20.5892 |
57
 
58
 
59
  ### Framework versions
60
 
61
  - Transformers 4.57.3
62
+ - Pytorch 2.9.1+cu128
63
+ - Datasets 4.4.2
64
  - Tokenizers 0.22.1