OpenMOSS TTSD β€” MLX

MLX-native int8 conversion of OpenMOSS TTSD for multi-speaker dialogue generation on Apple Silicon.

Variants

Path Precision
mlx-int8/ int8 quantized weights

How to Get Started

Text must include [S1]/[S2] speaker tags. Omitting them produces degraded output.

python scripts/generate/moss_ttsd.py \
  --text "[S1] Watson, I think we should go. [S2] Give me one moment." \
  --output outputs/dialogue.wav

Supported modes: generation, continuation, voice_clone, voice_clone_and_continuation.

python scripts/generate/moss_ttsd.py \
  --mode voice_clone \
  --text "[S1] This voice was cloned from the reference." \
  --prompt-audio-speaker1 reference.wav \
  --output outputs/clone.wav

Batch JSONL mode is also supported β€” see python scripts/generate/moss_ttsd.py --help.

Model Details

Links

License

Apache 2.0 β€” following the upstream license published with OpenMOSS-Team/MOSS-TTSD-v1.0.

Downloads last month

-

Downloads are not tracked for this model. How to track
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for appautomaton/openmoss-ttsd-mlx

Quantized
(2)
this model