AEmotionStudio commited on
Commit
5c0dd47
·
verified ·
1 Parent(s): ce206ff

Add README.md

Browse files
Files changed (1) hide show
  1. README.md +67 -0
README.md ADDED
@@ -0,0 +1,67 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - audio
5
+ - music
6
+ - source-separation
7
+ - stem-separation
8
+ - roformer
9
+ - safetensors
10
+ - maestraea
11
+ pipeline_tag: audio-to-audio
12
+ ---
13
+
14
+ # RoFormer Stem Separation Models (Safetensors)
15
+
16
+ **BS-RoFormer & MelBand RoFormer — State-of-the-art music source separation**
17
+
18
+ > Pretrained weights converted to safetensors format for use with [Mæstræa AI Workstation](https://github.com/AEmotionStudio/Maestraea).
19
+
20
+ ## Models
21
+
22
+ ### BS-RoFormer (Band-Split RoPE Transformer)
23
+
24
+ | Variant | SDR | Task | Path |
25
+ |---------|-----|------|------|
26
+ | Vocals (viperx) | 12.97 | Vocal/instrumental separation | `bs_roformer/vocals_viperx/` |
27
+ | Multi-stem | 9.65 | 4-stem (bass/drums/vocals/other) | `bs_roformer/multistem/` |
28
+
29
+ ### MelBand RoFormer (Mel-Band RoPE Transformer)
30
+
31
+ | Variant | SDR | Task | Path |
32
+ |---------|-----|------|------|
33
+ | Vocals (KimberleyJensen) | 10.98 | Best vocal isolation | `mel_band_roformer/vocals_kj/` |
34
+ | Vocals (viperx) | 11.43 | Vocal/instrumental separation | `mel_band_roformer/vocals_viperx/` |
35
+ | Dereverb (anvuew) | 19.17 | Remove reverb from audio | `mel_band_roformer/dereverb/` |
36
+ | Denoise (aufr33) | 27.99 | Remove noise from audio | `mel_band_roformer/denoise/` |
37
+
38
+ ## Architecture
39
+
40
+ Both models use the Band-Split RoPE Transformer architecture from [lucidrains/BS-RoFormer](https://github.com/lucidrains/BS-RoFormer):
41
+
42
+ - **BS-RoFormer**: Splits spectrogram into uniform-width subbands
43
+ - **MelBand RoFormer**: Splits using mel-scale (perceptually-weighted) overlapping bands
44
+
45
+ Both significantly outperform HTDemucs on vocal separation tasks.
46
+
47
+ ## Usage
48
+
49
+ Each model directory contains:
50
+ - `model.safetensors` — Model weights
51
+ - `config.yaml` — Architecture configuration (required for model instantiation)
52
+
53
+ Requires `bs-roformer` Python package: `pip install bs-roformer`
54
+
55
+ ## Credits
56
+
57
+ - **Architecture**: [lucidrains/BS-RoFormer](https://github.com/lucidrains/BS-RoFormer)
58
+ - **Training framework**: [ZFTurbo/Music-Source-Separation-Training](https://github.com/ZFTurbo/Music-Source-Separation-Training)
59
+ - **BS-RoFormer vocals**: [viperx](https://github.com/playdasegunda) via [TRvlvr](https://github.com/TRvlvr/model_repo)
60
+ - **MelBand vocals**: [KimberleyJensen](https://github.com/KimberleyJensen), [viperx](https://github.com/playdasegunda)
61
+ - **MelBand dereverb**: [anvuew](https://github.com/anvuew)
62
+ - **MelBand denoise**: [aufr33](https://github.com/aufr33)
63
+ - **Conversion & Mirror by**: [AEmotionStudio](https://huggingface.co/AEmotionStudio)
64
+
65
+ ## License
66
+
67
+ MIT — same as all upstream model releases.