ArVoice: A Multi-Speaker Dataset for Arabic Speech Synthesis
Paper β’ 2505.20506 β’ Published β’ 1
Fine-tuning unsloth/orpheus-3b-0.1-ft for Arabic speech synthesis using a 2-stage pipeline:
The training dataset is not included in this repository.
I used licensed/restricted data and cannot redistribute:
This repository only contains the training/inference code, configuration, and demo outputs.
unsloth/orpheus-3b-0.1-ftThis project is intended for research and educational purposes. Please respect the terms of any upstream model and dataset licenses.
@misc{toyin2025arvoicemultispeakerdatasetarabic,
title={ArVoice: A Multi-Speaker Dataset for Arabic Speech Synthesis},
author={Hawau Olamide Toyin and Rufael Marew and Humaid Alblooshi and Samar M. Magdy and Hanan Aldarmaki},
year={2025},
eprint={2505.20506},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2505.20506},
}