Voice Clone Pro ONNX
Files
pocket-tts-onnx/
βββ onnx/
β βββ flow_lm_main.onnx # 303 MB - Flow LM transformer (FP32)
β βββ flow_lm_main_int8.onnx # 76 MB - Flow LM transformer (INT8)
β βββ flow_lm_flow.onnx # 39 MB - Flow network (FP32)
β βββ flow_lm_flow_int8.onnx # 10 MB - Flow network (INT8)
β βββ mimi_decoder.onnx # 42 MB - Audio decoder (FP32)
β βββ mimi_decoder_int8.onnx # 23 MB - Audio decoder (INT8)
β βββ mimi_encoder.onnx # 73 MB - Voice encoder
β βββ text_conditioner.onnx # 16 MB - Text embeddings
βββ reference_sample.wav # Example voice reference
βββ tokenizer.model # SentencePiece tokenizer
βββ pocket_tts_onnx.py # Inference wrapper
βββ generate.py # CLI script
βββ requirements.txt # Python dependencies
βββ README.md
Model tree for walydevelopers/voice-clone-pro-onnx
Base model
kyutai/pocket-tts