--- license: mit tags: - audio - audio-separation - stem-separation - demucs - htdemucs - safetensors - maestraea pipeline_tag: audio-to-audio --- # HTDemucs Models (Safetensors) **4/6-Stem Source Separation — Vocals, Drums, Bass, Other (+Guitar, Piano)** [Original Source](https://github.com/facebookresearch/demucs) by [Facebook Research](https://github.com/facebookresearch) · MIT License > Converted from the original `.th` checkpoint format to safetensors for faster loading and safer deserialization. For use with [Mæstræa AI Workstation](https://github.com/AEmotionStudio/Maestraea). ## Available Models | File | Stems | Size | Description | |------|-------|------|-------------| | `htdemucs.safetensors` | 4 (drums, bass, other, vocals) | 84 MB | Base model | | `htdemucs_ft.safetensors` | 4 (drums, bass, other, vocals) | 84 MB | **Fine-tuned** — best quality ⭐ | | `htdemucs_6s.safetensors` | 6 (drums, bass, other, vocals, guitar, piano) | 55 MB | 6-stem variant | Each model has a matching `*_config.json` with architecture parameters (sources, sample rate, channels). ## What HTDemucs Does HTDemucs (Hybrid Transformer Demucs) separates mixed audio into individual stems: - **Vocals** — Singing, spoken word - **Drums** — Percussion, kick, snare, hi-hat - **Bass** — Bass guitar, synth bass - **Other** — Everything else (keys, synths, FX) - **Guitar** — (6-stem model only) - **Piano** — (6-stem model only) ### Key Features - Real-time capable on GPU - Adjustable segment size for VRAM control - Best-in-class separation quality (htdemucs_ft) - ~4–6 GB VRAM ## Original Checkpoint URLs These safetensors were converted from: | Model | Original URL | |-------|-------------| | htdemucs | `https://dl.fbaipublicfiles.com/demucs/hybrid_transformer/955717e8-8726e21a.th` | | htdemucs_ft | `https://dl.fbaipublicfiles.com/demucs/hybrid_transformer/04573f0d-f3cf25b2.th` | | htdemucs_6s | `https://dl.fbaipublicfiles.com/demucs/hybrid_transformer/5c90dfd2-34c22ccb.th` | ## Usage with Mæstræa These models are automatically downloaded by the Mæstræa AI Workstation backend. They can also be used directly with the `demucs` library: ```python from demucs.pretrained import get_model model = get_model("htdemucs_ft") ``` ## License MIT — same as the original Demucs release. ## Credits - **Model**: [Facebook Research / Meta AI](https://github.com/facebookresearch/demucs) - **Paper**: [Hybrid Transformers for Music Source Separation](https://arxiv.org/abs/2211.08553) (Rouard et al., 2023) - **Conversion & Mirror by**: [AEmotionStudio](https://huggingface.co/AEmotionStudio)