Add build_dataset.py: Complete data pipeline (YouTube scraping + HF datasets + synthetic generation)"
Browse files
data/requirements_data.txt
ADDED
|
@@ -0,0 +1,9 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
yt-dlp
|
| 2 |
+
Pillow
|
| 3 |
+
numpy
|
| 4 |
+
scipy
|
| 5 |
+
librosa
|
| 6 |
+
soundfile
|
| 7 |
+
datasets
|
| 8 |
+
huggingface_hub
|
| 9 |
+
imageio-ffmpeg
|