| --- |
| license: other |
| tags: |
| - audio |
| - tts |
| - vae |
| - vocoder |
| - safetensors |
| - dramabox |
| - resembleai |
| base_model: ResembleAI/Dramabox |
| --- |
| |
| # Dramabox — Audio VAE + Vocoder |
|
|
| This repository contains a **merged safetensors checkpoint** extracted from |
| [ResembleAI/Dramabox](https://huggingface.co/ResembleAI/Dramabox). |
|
|
| It includes **only the audio-generation weights**: |
|
|
| | Component | Keys prefix | Description | |
| |---|---|---| |
| | Audio VAE | `audio_vae.*` | Encoder / decoder VAE operating on mel-spectrograms (BF16) | |
| | Vocoder | `vocoder.vocoder.*` | HiFi-GAN style neural vocoder (BF16) | |
| | BWE Generator | `vocoder.bwe_generator.*` | Bandwidth extension generator (BF16) | |
| | Mel STFT | `vocoder.mel_stft.*` | Mel filterbank + STFT forward/inverse basis (BF16) | |
|
|
| All weights are stored in **BFloat16**. |
|
|
| ## File |
|
|
| | File | Contents | |
| |---|---| |
| | `dramabox-audiovae-vocoder.safetensors` | audio_vae + vocoder (merged) | |
| |
| ## Usage |
| |
| ```python |
| from safetensors import safe_open |
|
|
| tensors = {} |
| with safe_open("dramabox-audiovae-vocoder.safetensors", framework="pt", device="cpu") as f: |
| for key in f.keys(): |
| tensors[key] = f.get_tensor(key) |
|
|
| print(list(tensors.keys())[:5]) |
| ``` |
| |
| ## Source |
| |
| Extracted from the original |
| [ResembleAI/Dramabox](https://huggingface.co/ResembleAI/Dramabox) checkpoint. |
| Please refer to the original repository for licensing details. |
| |