Conformer-CTC fine-tuned on FluencyBank Timestamped

Fine-tuned from nvidia/stt_en_conformer_ctc_small on the FluencyBank Timestamped dataset (arielcerdap/TimeStamped-Splits), targeting verbatim transcription of stuttered speech from adults who stutter (PWS).

Results on test set

Metric Value
WER 18.26%

Training details

  • Base model: nvidia/stt_en_conformer_ctc_small
  • Dataset: arielcerdap/TimeStamped-Splits (train=2,744 / val=342 / test=342)
  • Target: verbatim transcription (disfluencies preserved)
  • Optimizer: AdamW (lr=1e-5, weight_decay=1e-3)
  • Scheduler: CosineAnnealing (warmup_ratio=0.05)
  • Batch effective: 64
  • Early stopping: patience=10 on val_wer
Downloads last month
8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train arielcerdap/conformer-ctc-small-fluencybank