Qwen3-ASR Arabic โ€” KSA Saudi Dialect

Fine-tuned Qwen/Qwen3-ASR-1.7B for KSA Saudi Arabic dialect speech recognition.

Sequential fine-tuning: Base Qwen3-ASR -> UAE model -> this KSA model.

Results

Metric Zero-shot (base) Fine-tuned Improvement
WER 14.41% 11.49% -20%
CER 6.57% 5.78% -12%

Evaluated on 849 KSA Arabic validation samples.

What improved

  • Better handling of KSA dialect expressions and vocabulary
  • Removes spurious punctuation that the base model adds
  • Matches informal Saudi dialect spelling conventions

Training Details

  • Base model: vadimbelsky/qwen3-asr-arabic-uae (itself fine-tuned from Qwen3-ASR-1.7B on UAE data)
  • Training data: ~7,350 KSA Saudi Arabic dialect samples from vadimbelsky/KSA_Arabic_English_Dataset_13k
  • Strategy: Audio encoder frozen, only LLM decoder fine-tuned (84.4% of params)
  • Precision: bfloat16
  • Epochs: 3
  • Effective batch size: 32 (batch 2 x gradient accumulation 16)
  • Learning rate: 1e-5 with linear schedule (lower than UAE stage)
  • Gradient checkpointing: enabled
  • Text normalization: Diacritics removed, alef/teh marbuta normalized, punctuation stripped

Usage

from qwen_asr import Qwen3ASRModel

model = Qwen3ASRModel.from_pretrained("vadimbelsky/qwen3-asr-arabic-ksa")
result = model.transcribe("audio.wav", language="Arabic")
print(result)

Or with transformers directly:

from transformers import AutoModelForCausalLM, AutoProcessor

model = AutoModelForCausalLM.from_pretrained("vadimbelsky/qwen3-asr-arabic-ksa")
processor = AutoProcessor.from_pretrained("vadimbelsky/qwen3-asr-arabic-ksa")

Limitations

  • Trained on synthetic/generated Arabic speech data
  • Optimized for KSA Saudi dialect โ€” may not generalize to other Arabic dialects
  • Numbers are sometimes transcribed differently (e.g. digits vs spelled out)
  • Short utterances only (training data mostly < 20s)

Related Models

License

Apache 2.0 (same as base model)

Downloads last month
17
Safetensors
Model size
2B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for vadimbelsky/qwen3-asr-arabic-ksa

Finetuned
(37)
this model

Space using vadimbelsky/qwen3-asr-arabic-ksa 1

Evaluation results