aufklarer commited on
Commit
1cb0565
·
verified ·
1 Parent(s): 19b10ae

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +36 -0
README.md ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - speech
5
+ - vad
6
+ - coreml
7
+ - voice-activity-detection
8
+ ---
9
+
10
+ # FireRedVAD — CoreML
11
+
12
+ CoreML conversion of FireRedVAD for voice activity detection on Apple Neural Engine. Supports 100+ languages with high accuracy (99.1% F1 on FLEURS).
13
+
14
+ ## Files
15
+
16
+ | File | Description |
17
+ |------|-------------|
18
+ | `fireredvad.mlmodelc` | VAD model (CoreML compiled) |
19
+ | `fireredvad.mlpackage` | VAD model (CoreML package) |
20
+ | `cmvn.json` | Feature normalization statistics |
21
+ | `config.json` | Model configuration |
22
+
23
+ ## Usage
24
+
25
+ Used by [speech-swift](https://github.com/soniqo/speech-swift) `SpeechVAD` module:
26
+
27
+ ```swift
28
+ let vad = try await FireRedVADModel.fromPretrained()
29
+ let segments = vad.detectSpeech(audio: samples, sampleRate: 16000)
30
+ ```
31
+
32
+ ---
33
+
34
+ - **Guide**: [soniqo.audio/guides/vad](https://soniqo.audio/guides/vad)
35
+ - **Docs**: [soniqo.audio](https://soniqo.audio)
36
+ - **GitHub**: [soniqo/speech-swift](https://github.com/soniqo/speech-swift)