stt-en-fastconformer-ctc-large-GGUF

GGUF quantisations of nvidia/stt_en_fastconformer_ctc_large for CrispASR.

Quant Size Description
F16 222 MB Full precision
Q8_0 132 MB 8-bit
Q5_0 95 MB 5-bit
Q4_K 83 MB 4-bit K-quant (recommended)

Architecture

18-layer NeMo FastConformer encoder + Conv1d CTC head. d_model=512, 8 heads, 1024 SentencePiece vocab, English only, 80 log-mel features, ~115M params.

Usage

crispasr --backend fastconformer-ctc -m stt-en-fastconformer-ctc-large-q4_k.gguf -f audio.wav

NeMo Family

Same --backend fastconformer-ctc supports large (18L/512d), xlarge (24L/1024d), and xxlarge (42L/1024d).

Downloads last month
130
GGUF
Model size
0.1B params
Architecture
canary-ctc
Hardware compatibility
Log In to add your hardware

5-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for cstr/stt-en-fastconformer-ctc-large-GGUF

Quantized
(1)
this model