Jaavid25 commited on
Commit
be52208
·
verified ·
1 Parent(s): 7f39b61

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +70 -3
README.md CHANGED
@@ -1,3 +1,70 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - music
5
+ - audio
6
+ - popularity-prediction
7
+ - aesthetic-quality
8
+ - multi-task-learning
9
+ - mert
10
+ - ai-generated-music
11
+ - suno
12
+ - udio
13
+ language:
14
+ - en
15
+ library_name: transformers
16
+ ---
17
+
18
+ # APEX: Large-Scale Multi-Task Aesthetic-Informed Popularity Prediction for AI-Generated Music
19
+
20
+ APEX is the first large-scale multi-task learning framework for jointly predicting **popularity** and **aesthetic quality** of AI-generated music from audio alone. It is trained on over 211k AI-generated songs (~10k hours of audio) from Suno and Udio, leveraging [MERT-v1-95M](https://huggingface.co/m-a-p/MERT-v1-95M) audio embeddings.
21
+
22
+ ---
23
+
24
+ ## What does APEX predict?
25
+
26
+ Given any audio file, APEX predicts 7 scores:
27
+
28
+ **Popularity:**
29
+ | Score | Range | Description |
30
+ |---|---|---|
31
+ | `score_streams` | 0–100 | Predicted streaming engagement score |
32
+ | `score_likes` | 0–100 | Predicted likes engagement score |
33
+
34
+ **Aesthetic Quality (from [SongEval](https://github.com/ASLP-lab/SongEval)):**
35
+ | Score | Range | Description |
36
+ |---|---|---|
37
+ | `coherence` | 1–5 | Structural and harmonic coherence |
38
+ | `musicality` | 1–5 | Overall musical quality |
39
+ | `memorability` | 1–5 | How memorable the song is |
40
+ | `clarity` | 1–5 | Clarity of production and mix |
41
+ | `naturalness` | 1–5 | Naturalness of the generated audio |
42
+
43
+ ---
44
+
45
+ ## Architecture
46
+
47
+ ![APEX Architecture](architecture.png)
48
+
49
+ ---
50
+
51
+ ## Usage
52
+
53
+ ### Installation
54
+
55
+ ```bash
56
+ pip install torch transformers soundfile torchaudio
57
+ ```
58
+
59
+ ### Inference
60
+
61
+ ```python
62
+ from transformers import AutoModel
63
+
64
+ model = AutoModel.from_pretrained("amaai-lab/apex", trust_remote_code=True)
65
+ results = model.predict("my_song.mp3", save_json="results.json")
66
+
67
+ print(results["score_streams"]) # popularity score 0-100
68
+ print(results["score_likes"]) # popularity score 0-100
69
+ print(results["coherence"]) # aesthetic score 1-5
70
+ ```