Voice Clone Pro ONNX

Files

pocket-tts-onnx/
β”œβ”€β”€ onnx/
β”‚   β”œβ”€β”€ flow_lm_main.onnx          # 303 MB - Flow LM transformer (FP32)
β”‚   β”œβ”€β”€ flow_lm_main_int8.onnx     #  76 MB - Flow LM transformer (INT8)
β”‚   β”œβ”€β”€ flow_lm_flow.onnx          #  39 MB - Flow network (FP32)
β”‚   β”œβ”€β”€ flow_lm_flow_int8.onnx     #  10 MB - Flow network (INT8)
β”‚   β”œβ”€β”€ mimi_decoder.onnx          #  42 MB - Audio decoder (FP32)
β”‚   β”œβ”€β”€ mimi_decoder_int8.onnx     #  23 MB - Audio decoder (INT8)
β”‚   β”œβ”€β”€ mimi_encoder.onnx          #  73 MB - Voice encoder
β”‚   └── text_conditioner.onnx      #  16 MB - Text embeddings
β”œβ”€β”€ reference_sample.wav           # Example voice reference
β”œβ”€β”€ tokenizer.model                # SentencePiece tokenizer
β”œβ”€β”€ pocket_tts_onnx.py             # Inference wrapper
β”œβ”€β”€ generate.py                    # CLI script
β”œβ”€β”€ requirements.txt               # Python dependencies
└── README.md
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for walydevelopers/voice-clone-pro-onnx

Quantized
(8)
this model