FiveC commited on
Commit
a1e2f12
·
verified ·
1 Parent(s): 2457ae6

End of training

Browse files
Files changed (3) hide show
  1. README.md +33 -34
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -18,8 +18,10 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [FiveC/BartTay](https://huggingface.co/FiveC/BartTay) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.1284
22
- - Sacrebleu: 17.1154
 
 
23
 
24
  ## Model description
25
 
@@ -49,38 +51,35 @@ The following hyperparameters were used during training:
49
 
50
  ### Training results
51
 
52
- | Training Loss | Epoch | Step | Validation Loss | Sacrebleu |
53
- |:-------------:|:------:|:----:|:---------------:|:---------:|
54
- | 0.2531 | 0.0996 | 128 | 0.2094 | 0.1428 |
55
- | 0.2268 | 0.1992 | 256 | 0.1940 | 1.0998 |
56
- | 0.2106 | 0.2988 | 384 | 0.1869 | 2.8366 |
57
- | 0.1997 | 0.3984 | 512 | 0.1815 | 3.3156 |
58
- | 0.2056 | 0.4981 | 640 | 0.1811 | 2.6645 |
59
- | 0.1913 | 0.5977 | 768 | 0.1758 | 4.2371 |
60
- | 0.1852 | 0.6973 | 896 | 0.1677 | 6.2441 |
61
- | 0.187 | 0.7969 | 1024 | 0.1631 | 6.8860 |
62
- | 0.173 | 0.8965 | 1152 | 0.1581 | 7.9818 |
63
- | 0.171 | 0.9961 | 1280 | 0.1569 | 8.3053 |
64
- | 0.1547 | 1.0957 | 1408 | 0.1559 | 9.2269 |
65
- | 0.1546 | 1.1953 | 1536 | 0.1496 | 10.2844 |
66
- | 0.1475 | 1.2949 | 1664 | 0.1478 | 11.3408 |
67
- | 0.1542 | 1.3946 | 1792 | 0.1454 | 11.5532 |
68
- | 0.1532 | 1.4942 | 1920 | 0.1431 | 12.3223 |
69
- | 0.1453 | 1.5938 | 2048 | 0.1410 | 12.8742 |
70
- | 0.1465 | 1.6934 | 2176 | 0.1381 | 13.6623 |
71
- | 0.1486 | 1.7930 | 2304 | 0.1379 | 14.0894 |
72
- | 0.1432 | 1.8926 | 2432 | 0.1353 | 15.0525 |
73
- | 0.1399 | 1.9922 | 2560 | 0.1334 | 14.8205 |
74
- | 0.1325 | 2.0918 | 2688 | 0.1340 | 15.4056 |
75
- | 0.128 | 2.1914 | 2816 | 0.1325 | 16.1499 |
76
- | 0.1238 | 2.2911 | 2944 | 0.1320 | 15.7701 |
77
- | 0.1307 | 2.3907 | 3072 | 0.1302 | 16.3446 |
78
- | 0.126 | 2.4903 | 3200 | 0.1307 | 16.6955 |
79
- | 0.1264 | 2.5899 | 3328 | 0.1296 | 16.8372 |
80
- | 0.12 | 2.6895 | 3456 | 0.1296 | 16.9080 |
81
- | 0.1271 | 2.7891 | 3584 | 0.1291 | 16.7139 |
82
- | 0.1196 | 2.8887 | 3712 | 0.1286 | 17.0726 |
83
- | 0.1244 | 2.9883 | 3840 | 0.1284 | 17.1154 |
84
 
85
 
86
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [FiveC/BartTay](https://huggingface.co/FiveC/BartTay) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.1129
22
+ - Sacrebleu: 31.5508
23
+ - Chrf++: 41.2951
24
+ - Bertscore F1: 0.8234
25
 
26
  ## Model description
27
 
 
51
 
52
  ### Training results
53
 
54
+ | Training Loss | Epoch | Step | Validation Loss | Sacrebleu | Chrf++ | Bertscore F1 |
55
+ |:-------------:|:------:|:-----:|:---------------:|:---------:|:-------:|:------------:|
56
+ | 0.2708 | 0.0999 | 548 | 0.1762 | 6.1958 | 14.3337 | 0.7402 |
57
+ | 0.1967 | 0.1998 | 1096 | 0.1543 | 13.0595 | 22.6490 | 0.7681 |
58
+ | 0.1653 | 0.2997 | 1644 | 0.1433 | 16.4281 | 26.5054 | 0.7790 |
59
+ | 0.148 | 0.3996 | 2192 | 0.1372 | 18.6916 | 29.1575 | 0.7880 |
60
+ | 0.1334 | 0.4995 | 2740 | 0.1309 | 20.7037 | 30.9321 | 0.7929 |
61
+ | 0.1234 | 0.5995 | 3288 | 0.1291 | 21.8427 | 31.8394 | 0.7953 |
62
+ | 0.1153 | 0.6994 | 3836 | 0.1260 | 23.2862 | 33.1552 | 0.7983 |
63
+ | 0.1123 | 0.7993 | 4384 | 0.1231 | 24.3244 | 34.1894 | 0.8022 |
64
+ | 0.1043 | 0.8992 | 4932 | 0.1210 | 25.3951 | 35.1031 | 0.8037 |
65
+ | 0.0982 | 0.9991 | 5480 | 0.1201 | 25.6618 | 35.4972 | 0.8048 |
66
+ | 0.0869 | 1.0990 | 6028 | 0.1193 | 25.8156 | 35.9535 | 0.8083 |
67
+ | 0.0857 | 1.1989 | 6576 | 0.1179 | 26.9340 | 36.8392 | 0.8107 |
68
+ | 0.0815 | 1.2988 | 7124 | 0.1179 | 27.6491 | 37.4053 | 0.8114 |
69
+ | 0.08 | 1.3987 | 7672 | 0.1172 | 28.0729 | 37.7781 | 0.8126 |
70
+ | 0.0781 | 1.4986 | 8220 | 0.1158 | 28.3941 | 38.2541 | 0.8146 |
71
+ | 0.0751 | 1.5985 | 8768 | 0.1145 | 28.9190 | 38.6033 | 0.8150 |
72
+ | 0.0743 | 1.6985 | 9316 | 0.1133 | 29.5192 | 39.0347 | 0.8163 |
73
+ | 0.0712 | 1.7984 | 9864 | 0.1131 | 29.9176 | 39.4411 | 0.8181 |
74
+ | 0.0714 | 1.8983 | 10412 | 0.1122 | 30.1874 | 39.6889 | 0.8190 |
75
+ | 0.069 | 1.9982 | 10960 | 0.1115 | 30.7540 | 40.5206 | 0.8205 |
76
+ | 0.0591 | 2.0981 | 11508 | 0.1148 | 30.3703 | 40.1852 | 0.8208 |
77
+ | 0.059 | 2.1980 | 12056 | 0.1139 | 30.3753 | 40.3092 | 0.8220 |
78
+ | 0.0583 | 2.2979 | 12604 | 0.1140 | 30.8041 | 40.6839 | 0.8216 |
79
+ | 0.058 | 2.3978 | 13152 | 0.1129 | 31.5508 | 41.2951 | 0.8234 |
80
+ | 0.0577 | 2.4977 | 13700 | 0.1126 | 30.9483 | 40.6855 | 0.8231 |
81
+ | 0.0564 | 2.5976 | 14248 | 0.1123 | 30.8206 | 40.7765 | 0.8235 |
82
+ | 0.0571 | 2.6975 | 14796 | 0.1118 | 31.1163 | 41.0993 | 0.8230 |
 
 
 
83
 
84
 
85
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fd35ce75ebc0ea976ab42326abe895822a80726f78cfc763e2bdc0ed1ba0e827
3
  size 1583480280
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7213350c61b1820422b1f6b9d92ba75e4005028517418321a946d8efa7bd214f
3
  size 1583480280
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7680a1d34a65b8a6efa920d2bcc5a95253e4d6b194d5e275a3f1a4be0ef0b969
3
  size 5969
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6cf1c4497c22456057fcbaffc461e5c0d1db7dfc2dede9ff6266535af75cc4eb
3
  size 5969