Whisper Podlodka Turbo β CoreML (WhisperKit)
CoreML version of bond005/whisper-podlodka-turbo converted for use with WhisperKit on Apple Silicon.
Model Details
- Architecture: Whisper Turbo (openai/whisper-large-v3-turbo fine-tuned)
- Format: CoreML
.mlmodelc(compiled, ready for Apple Neural Engine) - Size: ~1.5 GB total
- Languages: Russian (primary), English
- Fine-tuning: Trained on Russian podcast data (Podlodka podcast + Golos dataset)
Model Files
bond005_whisper-podlodka-turbo/
βββ AudioEncoder.mlmodelc/ β Audio encoder (1.2 GB)
βββ TextDecoder.mlmodelc/ β Text decoder (328 MB)
βββ MelSpectrogram.mlmodelc/ β Mel spectrogram preprocessing
βββ config.json β WhisperKit configuration
βββ generation_config.json β Generation parameters
βββ vocab.json, merges.txt β BPE tokenizer
βββ tokenizer_config.json
βββ normalizer.json
βββ *.mlcomputeplan.json β CoreML compute plans
Usage
High-quality Russian ASR model for WhisperKit on Apple Silicon.
// WhisperKit integration
let pipe = try await WhisperKit(
modelFolder: "/path/to/bond005_whisper-podlodka-turbo",
download: false
)
let result = try await pipe.transcribe(audioPath: audioURL.path)
Attribution
Base model whisper-large-v3-turbo by OpenAI. Russian fine-tune whisper-podlodka-turbo by @bond005. CoreML conversion for WhisperKit by @smkrv.
Model tree for smkrv/whisper-podlodka-turbo-coreml
Base model
openai/whisper-large-v3 Finetuned
openai/whisper-large-v3-turbo Finetuned
bond005/whisper-podlodka-turbo