mtolgakbaba commited on
Commit
69f1c53
·
1 Parent(s): 53a2a7b

End of training

Browse files
README.md CHANGED
@@ -12,6 +12,8 @@ should probably proofread and complete it, then remove this comment. -->
12
  # mt5-base
13
 
14
  This model was trained from scratch on an unknown dataset.
 
 
15
 
16
  ## Model description
17
 
@@ -30,20 +32,20 @@ More information needed
30
  ### Training hyperparameters
31
 
32
  The following hyperparameters were used during training:
33
- - learning_rate: 0.0002
34
- - train_batch_size: 2
35
- - eval_batch_size: 2
36
  - seed: 42
37
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
38
  - lr_scheduler_type: linear
39
- - num_epochs: 0.01
40
  - mixed_precision_training: Native AMP
41
 
42
  ### Training results
43
 
44
  | Training Loss | Epoch | Step | Validation Loss |
45
  |:-------------:|:-----:|:----:|:---------------:|
46
- | No log | 0.01 | 261 | nan |
47
 
48
 
49
  ### Framework versions
 
12
  # mt5-base
13
 
14
  This model was trained from scratch on an unknown dataset.
15
+ It achieves the following results on the evaluation set:
16
+ - Loss: nan
17
 
18
  ## Model description
19
 
 
32
  ### Training hyperparameters
33
 
34
  The following hyperparameters were used during training:
35
+ - learning_rate: 2e-05
36
+ - train_batch_size: 4
37
+ - eval_batch_size: 4
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
+ - num_epochs: 0.11
42
  - mixed_precision_training: Native AMP
43
 
44
  ### Training results
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
+ | 0.0 | 0.11 | 1431 | nan |
49
 
50
 
51
  ### Framework versions
runs/Dec03_17-04-34_9e16d6131049/events.out.tfevents.1701624376.9e16d6131049.601.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:661dd401ccdfb1d99ff0810e658e686c920eaf322b3fb3e94ba7c7b4c908879b
3
- size 4932
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:726d51e600ef5ccafa21e83bfe05657d55f496d3ae2f6205188034ffc223449d
3
+ size 5557