Whisper Podlodka Turbo β€” CoreML (WhisperKit)

CoreML version of bond005/whisper-podlodka-turbo converted for use with WhisperKit on Apple Silicon.

Model Details

  • Architecture: Whisper Turbo (openai/whisper-large-v3-turbo fine-tuned)
  • Format: CoreML .mlmodelc (compiled, ready for Apple Neural Engine)
  • Size: ~1.5 GB total
  • Languages: Russian (primary), English
  • Fine-tuning: Trained on Russian podcast data (Podlodka podcast + Golos dataset)

Model Files

bond005_whisper-podlodka-turbo/
β”œβ”€β”€ AudioEncoder.mlmodelc/     β€” Audio encoder (1.2 GB)
β”œβ”€β”€ TextDecoder.mlmodelc/      β€” Text decoder (328 MB)
β”œβ”€β”€ MelSpectrogram.mlmodelc/   β€” Mel spectrogram preprocessing
β”œβ”€β”€ config.json                β€” WhisperKit configuration
β”œβ”€β”€ generation_config.json     β€” Generation parameters
β”œβ”€β”€ vocab.json, merges.txt     β€” BPE tokenizer
β”œβ”€β”€ tokenizer_config.json
β”œβ”€β”€ normalizer.json
└── *.mlcomputeplan.json       β€” CoreML compute plans

Usage

High-quality Russian ASR model for WhisperKit on Apple Silicon.

// WhisperKit integration
let pipe = try await WhisperKit(
    modelFolder: "/path/to/bond005_whisper-podlodka-turbo",
    download: false
)
let result = try await pipe.transcribe(audioPath: audioURL.path)

Attribution

Base model whisper-large-v3-turbo by OpenAI. Russian fine-tune whisper-podlodka-turbo by @bond005. CoreML conversion for WhisperKit by @smkrv.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for smkrv/whisper-podlodka-turbo-coreml

Finetuned
(3)
this model