Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Quan's picture
1 2 2

Quan

wq2012
kamilakesbi's profile picture kganjam's profile picture shuyuej's profile picture
·
https://wangquan.me/
  • wq2012

AI & ML interests

None yet

Organizations

Google's profile picture diarizers-community's profile picture Speaker, Voice & Language (SVL) team's profile picture TFLite Hub's profile picture

authored 4 papers over 1 year ago

Improved Long-Form Speech Recognition by Jointly Modeling the Primary and Non-primary Speakers

Paper • 2312.11123 • Published Dec 18, 2023

CVSS Corpus and Massively Multilingual Speech-to-Speech Translation

Paper • 2201.03713 • Published Jan 11, 2022

SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System

Paper • 2104.02125 • Published Apr 5, 2021

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

Paper • 2202.12163 • Published Feb 24, 2022
authored 4 papers over 2 years ago

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Paper • 2401.03506 • Published Jan 7, 2024 • 15

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech

Paper • 1911.01601 • Published Nov 5, 2019

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis

Paper • 1806.04558 • Published Jun 12, 2018

Generalized End-to-End Loss for Speaker Verification

Paper • 1710.10467 • Published Oct 28, 2017
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs