Web Assembly Speaker Diarization Sherpa Onnx
Speaker diarization with Next-gen Kaldi and WebAssembly
FSA/FST algorithms, differentiable, with PyTorch compatibility. Automatic speech recognition
Speaker diarization with Next-gen Kaldi and WebAssembly
Convert spoken words into text
Generate SRT subtitles from video or audio files
Transcribe uploaded, recorded, or online audio to text
Generate subtitles (SRT) from video or audio files
vad asr with zipformer ctc
Convert speech to text in real-time with voice activity detection
Convert speech to text in real-time using your microphone
Transcribe audio to text using voice activity detection
Text-to-speech (TTS) with Next-gen Kaldi
Convert spoken words into text
Transcribe spoken Chinese into text
Source separation
Transcribe audio to text in multiple languages
Transcribe audio to text in various languages
Identify spoken language from audio
Tag audio files to identify events
Speaker diarization, speake segmentation,
Transcribe audio to text
Convert spoken words into text using voice input