Collections
Discover the best community collections!
Collections including paper arxiv:2303.00747
-
WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Paper • 2303.00747 • Published • 6 -
Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion
Paper • 2311.14836 • Published • 2 -
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations
Paper • 2308.11466 • Published • 1 -
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training
Paper • 2108.06209 • Published • 1
-
WhisperX: Time-Accurate Speech Transcription of Long-Form Audio
Paper • 2303.00747 • Published • 6 -
Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion
Paper • 2311.14836 • Published • 2 -
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations
Paper • 2308.11466 • Published • 1 -
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training
Paper • 2108.06209 • Published • 1