stt-en-fastconformer-ctc-large-GGUF
GGUF quantisations of nvidia/stt_en_fastconformer_ctc_large for CrispASR.
| Quant | Size | Description |
|---|---|---|
| F16 | 222 MB | Full precision |
| Q8_0 | 132 MB | 8-bit |
| Q5_0 | 95 MB | 5-bit |
| Q4_K | 83 MB | 4-bit K-quant (recommended) |
Architecture
18-layer NeMo FastConformer encoder + Conv1d CTC head. d_model=512, 8 heads, 1024 SentencePiece vocab, English only, 80 log-mel features, ~115M params.
Usage
crispasr --backend fastconformer-ctc -m stt-en-fastconformer-ctc-large-q4_k.gguf -f audio.wav
NeMo Family
Same --backend fastconformer-ctc supports large (18L/512d), xlarge (24L/1024d), and xxlarge (42L/1024d).
- Downloads last month
- 130
Hardware compatibility
Log In to add your hardware
5-bit
8-bit
16-bit
Model tree for cstr/stt-en-fastconformer-ctc-large-GGUF
Base model
nvidia/stt_en_fastconformer_ctc_large