dan su's picture

dan su

sudanenator

·

AI & ML interests

None yet

Recent Activity

new activity 6 days ago

SynDataLab/omnivoice-zh:May I ask whether this data is real or synthetic?

reacted to nyuuzyou's post with 🔥 about 1 year ago

🇷🇺 Russian Forum Messages Dataset - https://huggingface.co/datasets/nyuuzyou/ruforum Collection of approximately 58 million Russian forum messages featuring: - Complete message content from Russian online forums spanning 2010-2025 - Comprehensive metadata including unique message IDs and timestamps - Full text content preserving original user discussions and interactions - Monolingual dataset focused exclusively on Russian language content This dataset offers a unique textual archive of Russian online conversations suitable for text generation, sentiment analysis, and language modeling research. Released to the public domain under CC0 1.0 license.

repliedto ajibawa-2023's post about 1 year ago

Hi All, I recently released two Audio datasets which are generated using my earlier released dataset: https://huggingface.co/datasets/ajibawa-2023/Children-Stories-Collection First Audio Dataset:https://huggingface.co/datasets/ajibawa-2023/Audio-Children-Stories-Collection-Large has 5600++ stories in .mp3 format. Second Audio Dataset:https://huggingface.co/datasets/ajibawa-2023/Audio-Children-Stories-Collection has 600 stories in .mp3 format.

View all activity

Organizations

authored 8 papers about 2 years ago

DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion

Paper • 2105.13871 • Published May 28, 2021

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

Paper • 2106.06909 • Published Jun 13, 2021 • 1

FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis

Paper • 2204.09934 • Published Apr 21, 2022

A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity

Paper • 2302.04023 • Published Feb 8, 2023

Survey of Hallucination in Natural Language Generation

Paper • 2202.03629 • Published Feb 8, 2022

SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts

Paper • 2105.03036 • Published May 7, 2021 • 2

DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis

Paper • 2309.12792 • Published Sep 22, 2023 • 1

MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24, 2024 • 47