zuhri025
/

dramabox-audio-vae-vocoder

Model card Files Files and versions

dramabox-audio-vae-vocoder / README.md

zuhri025's picture

Add model card

3c5b5e5 verified 10 days ago

|

history blame contribute delete

1.37 kB

	---
	license: other
	tags:
	- audio
	- tts
	- vae
	- vocoder
	- safetensors
	- dramabox
	- resembleai
	base_model: ResembleAI/Dramabox
	---

	# Dramabox — Audio VAE + Vocoder

	This repository contains a merged safetensors checkpoint extracted from
	[ResembleAI/Dramabox](https://huggingface.co/ResembleAI/Dramabox).

	It includes only the audio-generation weights:

	\| Component \| Keys prefix \| Description \|
	\|---\|---\|---\|
	\| Audio VAE \| `audio_vae.*` \| Encoder / decoder VAE operating on mel-spectrograms (BF16) \|
	\| Vocoder \| `vocoder.vocoder.*` \| HiFi-GAN style neural vocoder (BF16) \|
	\| BWE Generator \| `vocoder.bwe_generator.*` \| Bandwidth extension generator (BF16) \|
	\| Mel STFT \| `vocoder.mel_stft.*` \| Mel filterbank + STFT forward/inverse basis (BF16) \|

	All weights are stored in BFloat16.

	## File

	\| File \| Contents \|
	\|---\|---\|
	\| `dramabox-audiovae-vocoder.safetensors` \| audio_vae + vocoder (merged) \|

	## Usage

	```python
	from safetensors import safe_open

	tensors = {}
	with safe_open("dramabox-audiovae-vocoder.safetensors", framework="pt", device="cpu") as f:
	for key in f.keys():
	tensors[key] = f.get_tensor(key)

	print(list(tensors.keys())[:5])
	```

	## Source

	Extracted from the original
	[ResembleAI/Dramabox](https://huggingface.co/ResembleAI/Dramabox) checkpoint.
	Please refer to the original repository for licensing details.