HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation Paper โข 2504.12330 โข Published Apr 13, 2025 โข 1
Whisper Models Dutch Language Collection This repo contains Dutch Whisper models finetuned on CV and other synthetic data, with different filtering options โข 11 items โข Updated Sep 16, 2025 โข 2
Whisper Models Portuguese Language Collection This Repo contains Whisper models trained on subsets of data like Common Voice 17(CV_17), Synthetic(Generated by OpenAI) + CV17 and Synthetic Only. โข 13 items โข Updated Mar 2 โข 2
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. โข 14 items โข Updated Dec 10, 2025 โข 22
Seamless: Multilingual Expressive and Streaming Speech Translation Paper โข 2312.05187 โข Published Dec 8, 2023 โข 14