Whisper for Turkish Call Centers

This model is a fine-tuned version of openai/whisper-large-v2 on the Custom turkish call center simulated data dataset. It achieves the following results on the evaluation set:

Loss: 0.0843

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-06
train_batch_size: 16
eval_batch_size: 8
seed: 42
optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 500
training_steps: 4500
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss
0.1319	0.1777	250	0.1265
0.1046	0.3554	500	0.1069
0.1004	0.5330	750	0.1001
0.0973	0.7107	1000	0.0966
0.0945	0.8884	1250	0.0923
0.0748	1.0661	1500	0.0897
0.0680	1.2438	1750	0.0902
0.0726	1.4215	2000	0.0873
0.0682	1.5991	2250	0.0848
0.0617	1.7768	2500	0.0838
0.0648	1.9545	2750	0.0828
0.0482	2.1322	3000	0.0850
0.0473	2.3099	3250	0.0840
0.0451	2.4876	3500	0.0832
0.0473	2.6652	3750	0.0835
0.0459	2.8429	4000	0.0836
0.0386	3.0206	4250	0.0830
0.0391	3.1983	4500	0.0843

Framework versions

Transformers 5.3.0
Pytorch 2.10.0+cu128
Datasets 4.8.4
Tokenizers 0.22.2

Downloads last month: 66

Safetensors

Model size

2B params

Tensor type

F32

Model tree for alpcansoydas/whisper-large-v2-tr-ft-25-03-26-encoder-only-25ksamples-original-data

Base model

openai/whisper-large-v2

Finetuned

(270)

this model