SoulX-Singer Models (Safetensors Mirror)

Safetensors conversion of Soul-AILab/SoulX-Singer weights for use in the MAESTRO AI Workstation.

Models

Path Size Description
svs/model.safetensors ~2.82 GB Singing Voice Synthesis (lyrics+MIDI → singing)
svc/model.safetensors ~2.79 GB Singing Voice Conversion (audio-to-audio)
config.yaml 579 B Model architecture configuration
phone_set.json ~30 KB Phoneme mapping for SVS

Architecture

  • Flow-matching based (F5-TTS foundation)
  • 22-layer transformer with 1024 hidden size, 16 heads
  • 128-dim mel spectrogram, 24kHz output
  • Trained on 42,000+ hours of aligned vocals (Mandarin, English, Cantonese)

License

Apache 2.0

Downloads last month
5
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support