Sara5115
/

dialect_conversion_model

text2text-generation

Generated from Trainer

Model card Files Files and versions

Sara5115 commited on Jan 31, 2025

Commit

852d275

·

verified ·

1 Parent(s): 13b603e

End of training

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/bart-large](https://huggingface.co/facebook/bart-large) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 6.0415
 ## Model description
@@ -42,13 +42,15 @@ The following hyperparameters were used during training:
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 5
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 6.9456        | 2.9412 | 100  | 6.0415          |
 ### Framework versions

 This model is a fine-tuned version of [facebook/bart-large](https://huggingface.co/facebook/bart-large) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0474
 ## Model description
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 2.9964        | 2.9412 | 100  | 2.5603          |
+| 0.3588        | 5.8824 | 200  | 0.2224          |
+| 0.0304        | 8.8235 | 300  | 0.0474          |
 ### Framework versions