wh-stage1ft-lr1e7-dtstf5-adm-ga1ba16-st15k-v2-evalstp50-pat20-trainvalch

This model is a fine-tuned version of HouraMor/wh-ft-lr5e6-dtstf5-adm-ga1ba16-st15k-v2-evalstp500-pat5 on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 0.5604
Wer: 0.2804
Cer: 0.2126

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 1e-07
train_batch_size: 16
eval_batch_size: 8
seed: 42
optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 25
training_steps: 5000
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Wer	Cer
0.3641	0.0201	10	0.5640	0.3219	0.2368
0.3048	0.0402	20	0.5639	0.3213	0.2363
0.2037	0.0602	30	0.5636	0.3207	0.2359
0.3871	0.0803	40	0.5631	0.2986	0.2261
0.2966	0.1004	50	0.5628	0.2984	0.2259
0.2678	0.1205	60	0.5625	0.2986	0.2260
0.2354	0.1406	70	0.5623	0.2982	0.2254
0.2986	0.1606	80	0.5620	0.2979	0.2254
0.2941	0.1807	90	0.5617	0.2991	0.2270
0.3354	0.2008	100	0.5615	0.2893	0.2183
0.278	0.2209	110	0.5613	0.2893	0.2182
0.2274	0.2410	120	0.5611	0.2892	0.2182
0.2784	0.2610	130	0.5611	0.2888	0.2180
0.2479	0.2811	140	0.5610	0.2885	0.2177
0.3538	0.3012	150	0.5609	0.2891	0.2181
0.3683	0.3213	160	0.5608	0.2881	0.2174
0.2722	0.3414	170	0.5607	0.2886	0.2177
0.3154	0.3614	180	0.5605	0.2882	0.2178
0.1633	0.3815	190	0.5604	0.2881	0.2174
0.2839	0.4016	200	0.5603	0.2875	0.2171
0.2588	0.4217	210	0.5603	0.2872	0.2166
0.2303	0.4418	220	0.5602	0.2864	0.2161
0.251	0.4618	230	0.5601	0.2860	0.2160
0.1899	0.4819	240	0.5601	0.2865	0.2165
0.3008	0.5020	250	0.5601	0.2851	0.2155
0.3326	0.5221	260	0.5600	0.2851	0.2156
0.3676	0.5422	270	0.5599	0.2846	0.2152
0.275	0.5622	280	0.5598	0.2849	0.2153
0.2401	0.5823	290	0.5598	0.2853	0.2155
0.2577	0.6024	300	0.5598	0.2840	0.2146
0.2983	0.6225	310	0.5599	0.2841	0.2147
0.3534	0.6426	320	0.5599	0.2834	0.2144
0.1726	0.6627	330	0.5598	0.2834	0.2145
0.2811	0.6827	340	0.5598	0.2824	0.2137
0.3336	0.7028	350	0.5597	0.2824	0.2140
0.3336	0.7229	360	0.5597	0.2827	0.2143
0.2073	0.7430	370	0.5597	0.2826	0.2142
0.1993	0.7631	380	0.5597	0.2829	0.2145
0.2417	0.7831	390	0.5598	0.2820	0.2139
0.2221	0.8032	400	0.5599	0.2825	0.2145
0.2726	0.8233	410	0.5600	0.2832	0.2149
0.2351	0.8434	420	0.5600	0.2835	0.2150
0.2651	0.8635	430	0.5600	0.2830	0.2147
0.3063	0.8835	440	0.5601	0.2831	0.2148
0.2651	0.9036	450	0.5601	0.2824	0.2145
0.2788	0.9237	460	0.5601	0.2825	0.2142
0.2582	0.9438	470	0.5602	0.2827	0.2140
0.3131	0.9639	480	0.5603	0.2818	0.2134
0.3044	0.9839	490	0.5603	0.2814	0.2132
0.4349	1.0040	500	0.5602	0.2819	0.2135
0.302	1.0241	510	0.5601	0.2819	0.2134
0.3211	1.0442	520	0.5601	0.2815	0.2131
0.4045	1.0643	530	0.5601	0.2811	0.2131
0.3015	1.0843	540	0.5601	0.2814	0.2133
0.1915	1.1044	550	0.5602	0.2811	0.2131
0.284	1.1245	560	0.5603	0.2811	0.2132
0.2912	1.1446	570	0.5604	0.2804	0.2126

Framework versions

Transformers 4.55.2
Pytorch 2.7.0+cu118
Datasets 2.21.0
Tokenizers 0.21.4

Downloads last month: 1

Safetensors

Model size

2B params

Tensor type

F32

Model tree for HouraMor/wh-stage1ft-lr1e7-dtstf5-adm-ga1ba16-st15k-v2-evalstp50-pat20-trainvalch

Base model

openai/whisper-large-v3

Finetuned

HouraMor/wh-ft-lr5e6-dtstf5-adm-ga1ba16-st15k-v2-evalstp500-pat5

Finetuned

(8)

this model