byt5-small-wikipron-eng-latn-multi-broad-p2g

This model is a fine-tuned version of google/byt5-small on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Per	Gen Len
2.0082	1.0	1177	0.4061	0.6392	8.2917
0.4295	2.0	2354	0.2953	0.5242	8.3425
0.3179	3.0	3531	0.2338	0.4552	8.4024
0.255	4.0	4708	0.2011	0.4038	8.4287
0.2131	5.0	5885	0.1753	0.3669	8.4356
0.1813	6.0	7062	0.1567	0.3341	8.4336
0.157	7.0	8239	0.1459	0.3098	8.4546
0.1368	8.0	9416	0.1349	0.2859	8.4531
0.1202	9.0	10593	0.1302	0.2663	8.4621
0.1067	10.0	11770	0.1240	0.2514	8.4701
0.0946	11.0	12947	0.1203	0.2415	8.4734
0.0857	12.0	14124	0.1180	0.2347	8.4782
0.0779	13.0	15301	0.1187	0.226	8.4827
0.0709	14.0	16478	0.1180	0.2211	8.4781
0.0646	15.0	17655	0.1176	0.2147	8.4856
0.0602	16.0	18832	0.1178	0.2129	8.4858
0.0563	17.0	20009	0.1200	0.2113	8.4844
0.0532	18.0	21186	0.1218	0.2069	8.4907
0.0501	19.0	22363	0.1228	0.2057	8.4891
0.0486	20.0	23540	0.1238	0.2052	8.4891

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support