EdgeVox Models
Consolidated STT and TTS models for EdgeVox โ sub-second local voice AI for robots and edge devices.
STT Models
| Model | License | Original Source |
|---|---|---|
sherpa-zipformer-vi-30M-int8 |
Apache-2.0 | csukuangfj2/sherpa-onnx-zipformer-vi-30M-int8-2026-02-09 |
Piper Voices
All Piper voices are ONNX models based on the VITS architecture. They run in real-time on CPU.
Legend: Y = freely redistributable, ? = license unclear or has restrictions (see per-voice README)
Other TTS Models (also included in this repo)
| Model | Languages | License | Size |
|---|---|---|---|
| Kokoro-82M | en, en-gb, fr, es, hi, it, pt, ja, zh (25 voices) | Apache 2.0 | ~338 MB |
| Supertonic-2 | ko, en, es, pt, fr (10 voices) | MIT (code) + OpenRAIL-M (weights) | ~255 MB |
| PyThaiTTS | th (1 voice) | Apache 2.0 | ~163 MB |
Usage
from huggingface_hub import hf_hub_download
# Piper voice
model = hf_hub_download("nrl-ai/edgevox-models", "piper/de-thorsten/model.onnx")
config = hf_hub_download("nrl-ai/edgevox-models", "piper/de-thorsten/config.json")
# Kokoro
kokoro_model = hf_hub_download("nrl-ai/edgevox-models", "kokoro/kokoro-v1.0.onnx")
kokoro_voices = hf_hub_download("nrl-ai/edgevox-models", "kokoro/voices-v1.0.bin")
# Supertonic
sup_encoder = hf_hub_download("nrl-ai/edgevox-models", "supertonic/onnx/text_encoder.onnx")
License
This repository contains models under mixed licenses. Each voice subdirectory has its own README with license details. See individual voice directories for specifics.
Models marked as CC0 or CC-BY-4.0 are freely usable. Models with unknown or NC/SA licenses are included for convenience with clear notes โ check the license before commercial use.
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support