End of training
Browse files- README.md +33 -34
- model.safetensors +1 -1
- training_args.bin +1 -1
README.md
CHANGED
|
@@ -18,8 +18,10 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [FiveC/BartTay](https://huggingface.co/FiveC/BartTay) on an unknown dataset.
|
| 20 |
It achieves the following results on the evaluation set:
|
| 21 |
-
- Loss: 0.
|
| 22 |
-
- Sacrebleu:
|
|
|
|
|
|
|
| 23 |
|
| 24 |
## Model description
|
| 25 |
|
|
@@ -49,38 +51,35 @@ The following hyperparameters were used during training:
|
|
| 49 |
|
| 50 |
### Training results
|
| 51 |
|
| 52 |
-
| Training Loss | Epoch | Step
|
| 53 |
-
|:-------------:|:------:|:----:|:---------------:|:---------:|
|
| 54 |
-
| 0.
|
| 55 |
-
| 0.
|
| 56 |
-
| 0.
|
| 57 |
-
| 0.
|
| 58 |
-
| 0.
|
| 59 |
-
| 0.
|
| 60 |
-
| 0.
|
| 61 |
-
| 0.
|
| 62 |
-
| 0.
|
| 63 |
-
| 0.
|
| 64 |
-
| 0.
|
| 65 |
-
| 0.
|
| 66 |
-
| 0.
|
| 67 |
-
| 0.
|
| 68 |
-
| 0.
|
| 69 |
-
| 0.
|
| 70 |
-
| 0.
|
| 71 |
-
| 0.
|
| 72 |
-
| 0.
|
| 73 |
-
| 0.
|
| 74 |
-
| 0.
|
| 75 |
-
| 0.
|
| 76 |
-
| 0.
|
| 77 |
-
| 0.
|
| 78 |
-
| 0.
|
| 79 |
-
| 0.
|
| 80 |
-
| 0.
|
| 81 |
-
| 0.1271 | 2.7891 | 3584 | 0.1291 | 16.7139 |
|
| 82 |
-
| 0.1196 | 2.8887 | 3712 | 0.1286 | 17.0726 |
|
| 83 |
-
| 0.1244 | 2.9883 | 3840 | 0.1284 | 17.1154 |
|
| 84 |
|
| 85 |
|
| 86 |
### Framework versions
|
|
|
|
| 18 |
|
| 19 |
This model is a fine-tuned version of [FiveC/BartTay](https://huggingface.co/FiveC/BartTay) on an unknown dataset.
|
| 20 |
It achieves the following results on the evaluation set:
|
| 21 |
+
- Loss: 0.1129
|
| 22 |
+
- Sacrebleu: 31.5508
|
| 23 |
+
- Chrf++: 41.2951
|
| 24 |
+
- Bertscore F1: 0.8234
|
| 25 |
|
| 26 |
## Model description
|
| 27 |
|
|
|
|
| 51 |
|
| 52 |
### Training results
|
| 53 |
|
| 54 |
+
| Training Loss | Epoch | Step | Validation Loss | Sacrebleu | Chrf++ | Bertscore F1 |
|
| 55 |
+
|:-------------:|:------:|:-----:|:---------------:|:---------:|:-------:|:------------:|
|
| 56 |
+
| 0.2708 | 0.0999 | 548 | 0.1762 | 6.1958 | 14.3337 | 0.7402 |
|
| 57 |
+
| 0.1967 | 0.1998 | 1096 | 0.1543 | 13.0595 | 22.6490 | 0.7681 |
|
| 58 |
+
| 0.1653 | 0.2997 | 1644 | 0.1433 | 16.4281 | 26.5054 | 0.7790 |
|
| 59 |
+
| 0.148 | 0.3996 | 2192 | 0.1372 | 18.6916 | 29.1575 | 0.7880 |
|
| 60 |
+
| 0.1334 | 0.4995 | 2740 | 0.1309 | 20.7037 | 30.9321 | 0.7929 |
|
| 61 |
+
| 0.1234 | 0.5995 | 3288 | 0.1291 | 21.8427 | 31.8394 | 0.7953 |
|
| 62 |
+
| 0.1153 | 0.6994 | 3836 | 0.1260 | 23.2862 | 33.1552 | 0.7983 |
|
| 63 |
+
| 0.1123 | 0.7993 | 4384 | 0.1231 | 24.3244 | 34.1894 | 0.8022 |
|
| 64 |
+
| 0.1043 | 0.8992 | 4932 | 0.1210 | 25.3951 | 35.1031 | 0.8037 |
|
| 65 |
+
| 0.0982 | 0.9991 | 5480 | 0.1201 | 25.6618 | 35.4972 | 0.8048 |
|
| 66 |
+
| 0.0869 | 1.0990 | 6028 | 0.1193 | 25.8156 | 35.9535 | 0.8083 |
|
| 67 |
+
| 0.0857 | 1.1989 | 6576 | 0.1179 | 26.9340 | 36.8392 | 0.8107 |
|
| 68 |
+
| 0.0815 | 1.2988 | 7124 | 0.1179 | 27.6491 | 37.4053 | 0.8114 |
|
| 69 |
+
| 0.08 | 1.3987 | 7672 | 0.1172 | 28.0729 | 37.7781 | 0.8126 |
|
| 70 |
+
| 0.0781 | 1.4986 | 8220 | 0.1158 | 28.3941 | 38.2541 | 0.8146 |
|
| 71 |
+
| 0.0751 | 1.5985 | 8768 | 0.1145 | 28.9190 | 38.6033 | 0.8150 |
|
| 72 |
+
| 0.0743 | 1.6985 | 9316 | 0.1133 | 29.5192 | 39.0347 | 0.8163 |
|
| 73 |
+
| 0.0712 | 1.7984 | 9864 | 0.1131 | 29.9176 | 39.4411 | 0.8181 |
|
| 74 |
+
| 0.0714 | 1.8983 | 10412 | 0.1122 | 30.1874 | 39.6889 | 0.8190 |
|
| 75 |
+
| 0.069 | 1.9982 | 10960 | 0.1115 | 30.7540 | 40.5206 | 0.8205 |
|
| 76 |
+
| 0.0591 | 2.0981 | 11508 | 0.1148 | 30.3703 | 40.1852 | 0.8208 |
|
| 77 |
+
| 0.059 | 2.1980 | 12056 | 0.1139 | 30.3753 | 40.3092 | 0.8220 |
|
| 78 |
+
| 0.0583 | 2.2979 | 12604 | 0.1140 | 30.8041 | 40.6839 | 0.8216 |
|
| 79 |
+
| 0.058 | 2.3978 | 13152 | 0.1129 | 31.5508 | 41.2951 | 0.8234 |
|
| 80 |
+
| 0.0577 | 2.4977 | 13700 | 0.1126 | 30.9483 | 40.6855 | 0.8231 |
|
| 81 |
+
| 0.0564 | 2.5976 | 14248 | 0.1123 | 30.8206 | 40.7765 | 0.8235 |
|
| 82 |
+
| 0.0571 | 2.6975 | 14796 | 0.1118 | 31.1163 | 41.0993 | 0.8230 |
|
|
|
|
|
|
|
|
|
|
| 83 |
|
| 84 |
|
| 85 |
### Framework versions
|
model.safetensors
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 1583480280
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7213350c61b1820422b1f6b9d92ba75e4005028517418321a946d8efa7bd214f
|
| 3 |
size 1583480280
|
training_args.bin
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
size 5969
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6cf1c4497c22456057fcbaffc461e5c0d1db7dfc2dede9ff6266535af75cc4eb
|
| 3 |
size 5969
|