MioTTS-0.6B Bulgarian Fine-Tuned (BG/EN)

Български

Това е fine-tuned версия на Aratako/MioTTS-0.6B, адаптирана за български TTS.

Fine-tune на LLM частта за български текст → speech tokens.
Обучение върху български dataset (24kHz, двама говорители), с финален run върху филтриран subset.
Запазен е оригиналният tokenizer/архитектура (Qwen3ForCausalLM, vocab 164480).
Няма промени в архитектурата на модела, само в learned weights.
За inference е използван MioCodec pipeline (съвместим с Aratako/MioCodec-25Hz-24kHz).

This is a fine-tuned version of Aratako/MioTTS-0.6B, adapted for Bulgarian TTS.

Fine-tuned the LLM component for Bulgarian text → speech-token generation.
Trained on a Bulgarian dataset (24kHz, two speakers), with the final run on a filtered subset.
Preserved original tokenizer/architecture (Qwen3ForCausalLM, vocab 164480).
No architectural changes; only model weights were updated.
Inference is used with MioCodec pipeline (compatible with Aratako/MioCodec-25Hz-24kHz).

Safetensors

Model size

0.6B params

Tensor type

BF16

Base model

Finetuned

Finetuned

(1)

this model

Quantizations