earlab commited on
Commit
1b5567a
·
verified ·
1 Parent(s): defd3e3

update readme

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -17,6 +17,8 @@ tags:
17
 
18
  This repository contains the official inference code for εar-VAE, aa 44.1 kHz music signal reconstruction model that rethinks and optimizes VAE training for audio. It targets two common weaknesses in existing open-source VAEs—phase accuracy and stereophonic spatial representation—by aligning objectives with auditory perception and introducing phase-aware training. Experiments show substantial improvements across diverse metrics, with particular strength in high-frequency harmonics and spatial characteristics.
19
 
 
 
20
  Why εar-VAE:
21
  - 🎧 Perceptual alignment: A K-weighting perceptual filter is applied before loss computation to better match human hearing.
22
  - 🔁 Phase-aware objectives: Two novel phase losses
 
17
 
18
  This repository contains the official inference code for εar-VAE, aa 44.1 kHz music signal reconstruction model that rethinks and optimizes VAE training for audio. It targets two common weaknesses in existing open-source VAEs—phase accuracy and stereophonic spatial representation—by aligning objectives with auditory perception and introducing phase-aware training. Experiments show substantial improvements across diverse metrics, with particular strength in high-frequency harmonics and spatial characteristics.
19
 
20
+ > ⭐2025-12-10 Update⭐: a new model weight works in 48kHz sample rate, same-level vocal performance with better stereophonic energy reconstruction.
21
+
22
  Why εar-VAE:
23
  - 🎧 Perceptual alignment: A K-weighting perceptual filter is applied before loss computation to better match human hearing.
24
  - 🔁 Phase-aware objectives: Two novel phase losses