k2-fsa

non-profit

https://github.com/k2-fsa/

AI & ML interests

FSA/FST algorithms, differentiable, with PyTorch compatibility. Automatic speech recognition

Recent Activity

zhu-han new activity 1 day ago

k2-fsa/TTS_eval_models:Add library_name, paper links, and citation

zhu-han new activity 1 day ago

k2-fsa/OpenDialog:Add task categories, language tags, and project links

yfyeung authored a paper 6 days ago

Representation-Regularized Convolutional Audio Transformer for Audio Understanding

View all activity

k2-fsa 's Spaces 50

Web Assembly Speaker Diarization Sherpa Onnx

Speaker diarization with Next-gen Kaldi and WebAssembly

Web Assembly Asr Sherpa Onnx En

Convert spoken words into text

Generate subtitles

Generate SRT subtitles from video or audio files

Automatic Speech Recognition

Transcribe uploaded, recorded, or online audio to text

Generate subtitles

Generate subtitles (SRT) from video or audio files

Web Assembly Vad Asr Sherpa Onnx Zh Zipformer Ctc

vad asr with zipformer ctc

Web Assembly Vad Asr Sherpa Onnx Zh En Paraformer

Convert speech to text in real-time with voice activity detection

Web Assembly Vad Asr Sherpa Onnx Th Zipformer

Convert speech to text in real-time using your microphone

Web Assembly Vad Asr Sherpa Onnx Ja Zipformer

Transcribe audio to text using voice activity detection

tts Text To Speech

Text-to-speech (TTS) with Next-gen Kaldi

Web Assembly Asr Sherpa Ncnn Zh En

Convert spoken words into text

Web Assembly Vad Asr Sherpa Onnx Zh Telespeech

Transcribe spoken Chinese into text

Source Separation

Source separation

Web Assembly Asr Sherpa Onnx Zh En Jp Ko Cantonese

Transcribe audio to text in multiple languages

Automatic Speech Recognition

Transcribe audio to text in various languages

Spoken Language Identification

Identify spoken language from audio

Audio Tagging

Tag audio files to identify events

Speaker Diarization

Speaker diarization, speake segmentation,

Text To Speech

Transcribe audio to text

Web Assembly Asr Sherpa Ncnn En

Convert spoken words into text using voice input