File size: 1,371 Bytes
3c5b5e5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
---
license: other
tags:
  - audio
  - tts
  - vae
  - vocoder
  - safetensors
  - dramabox
  - resembleai
base_model: ResembleAI/Dramabox
---

# Dramabox — Audio VAE + Vocoder

This repository contains a **merged safetensors checkpoint** extracted from
[ResembleAI/Dramabox](https://huggingface.co/ResembleAI/Dramabox).

It includes **only the audio-generation weights**:

| Component | Keys prefix | Description |
|---|---|---|
| Audio VAE | `audio_vae.*` | Encoder / decoder VAE operating on mel-spectrograms (BF16) |
| Vocoder | `vocoder.vocoder.*` | HiFi-GAN style neural vocoder (BF16) |
| BWE Generator | `vocoder.bwe_generator.*` | Bandwidth extension generator (BF16) |
| Mel STFT | `vocoder.mel_stft.*` | Mel filterbank + STFT forward/inverse basis (BF16) |

All weights are stored in **BFloat16**.

## File

| File | Contents |
|---|---|
| `dramabox-audiovae-vocoder.safetensors` | audio_vae + vocoder (merged) |

## Usage

```python
from safetensors import safe_open

tensors = {}
with safe_open("dramabox-audiovae-vocoder.safetensors", framework="pt", device="cpu") as f:
    for key in f.keys():
        tensors[key] = f.get_tensor(key)

print(list(tensors.keys())[:5])
```

## Source

Extracted from the original
[ResembleAI/Dramabox](https://huggingface.co/ResembleAI/Dramabox) checkpoint.
Please refer to the original repository for licensing details.