update readme
Browse files
README.md
CHANGED
|
@@ -17,6 +17,8 @@ tags:
|
|
| 17 |
|
| 18 |
This repository contains the official inference code for εar-VAE, aa 44.1 kHz music signal reconstruction model that rethinks and optimizes VAE training for audio. It targets two common weaknesses in existing open-source VAEs—phase accuracy and stereophonic spatial representation—by aligning objectives with auditory perception and introducing phase-aware training. Experiments show substantial improvements across diverse metrics, with particular strength in high-frequency harmonics and spatial characteristics.
|
| 19 |
|
|
|
|
|
|
|
| 20 |
Why εar-VAE:
|
| 21 |
- 🎧 Perceptual alignment: A K-weighting perceptual filter is applied before loss computation to better match human hearing.
|
| 22 |
- 🔁 Phase-aware objectives: Two novel phase losses
|
|
|
|
| 17 |
|
| 18 |
This repository contains the official inference code for εar-VAE, aa 44.1 kHz music signal reconstruction model that rethinks and optimizes VAE training for audio. It targets two common weaknesses in existing open-source VAEs—phase accuracy and stereophonic spatial representation—by aligning objectives with auditory perception and introducing phase-aware training. Experiments show substantial improvements across diverse metrics, with particular strength in high-frequency harmonics and spatial characteristics.
|
| 19 |
|
| 20 |
+
> ⭐2025-12-10 Update⭐: a new model weight works in 48kHz sample rate, same-level vocal performance with better stereophonic energy reconstruction.
|
| 21 |
+
|
| 22 |
Why εar-VAE:
|
| 23 |
- 🎧 Perceptual alignment: A K-weighting perceptual filter is applied before loss computation to better match human hearing.
|
| 24 |
- 🔁 Phase-aware objectives: Two novel phase losses
|