Whisper for Turkish Call Centers

This model is a fine-tuned version of openai/whisper-large-v2 on the Custom turkish call center simulated data dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0843

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-06
  • train_batch_size: 16
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 500
  • training_steps: 4500
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss
0.1319 0.1777 250 0.1265
0.1046 0.3554 500 0.1069
0.1004 0.5330 750 0.1001
0.0973 0.7107 1000 0.0966
0.0945 0.8884 1250 0.0923
0.0748 1.0661 1500 0.0897
0.0680 1.2438 1750 0.0902
0.0726 1.4215 2000 0.0873
0.0682 1.5991 2250 0.0848
0.0617 1.7768 2500 0.0838
0.0648 1.9545 2750 0.0828
0.0482 2.1322 3000 0.0850
0.0473 2.3099 3250 0.0840
0.0451 2.4876 3500 0.0832
0.0473 2.6652 3750 0.0835
0.0459 2.8429 4000 0.0836
0.0386 3.0206 4250 0.0830
0.0391 3.1983 4500 0.0843

Framework versions

  • Transformers 5.3.0
  • Pytorch 2.10.0+cu128
  • Datasets 4.8.4
  • Tokenizers 0.22.2
Downloads last month
66
Safetensors
Model size
2B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for alpcansoydas/whisper-large-v2-tr-ft-25-03-26-encoder-only-25ksamples-original-data

Finetuned
(270)
this model