---
license: apache-2.0
tags:
- automatic-speech-recognition
- whisper
- windyword
- english
- multilingual
library_name: transformers
pipeline_tag: automatic-speech-recognition
language:
- en
- multilingual
---

# WindyWord.ai STT — Windy Lite

**Multilingual speech-to-text engine. Transcribes audio in 100+ languages, with English as the primary trained domain.**

## Profile

- **Architecture:** 74M params · whisper-base
- **Profile:** fast
- **Base model:** [openai/whisper-base](https://huggingface.co/openai/whisper-base)

## Variants in this repo

| Subfolder | Format | Use case |
|---|---|---|
| `safetensors/` | PyTorch safetensors (FP32) | GPU inference (highest quality) |
| `ct2-int8/` | CTranslate2 INT8 | CPU inference (~25% size, 2-4× faster) |
| `onnx/` | ONNX FP32 | Cross-platform deployment |
| `onnx-int8/` | ONNX INT8 | Edge / mobile / WebAssembly |

## Usage

```python
from transformers import WhisperForConditionalGeneration, WhisperProcessor
processor = WhisperProcessor.from_pretrained("WindyWord/listen-windy-lite", subfolder="safetensors")
model = WhisperForConditionalGeneration.from_pretrained("WindyWord/listen-windy-lite", subfolder="safetensors")
```

For CPU inference via CTranslate2:
```python
import ctranslate2
# After downloading the ct2-int8 subfolder:
model = ctranslate2.models.Whisper("path/to/ct2-int8/")
```

## Commercial Use

Part of the [WindyWord.ai](https://windyword.ai) STT fleet. Visit windyword.ai for real-time voice-to-text + translation apps and API access.

---

## Provenance & License

Weights derived from [openai/whisper-base](https://huggingface.co/openai/whisper-base) under Apache-2.0 (inherited). Voice tiers are direct redistributions of the upstream community Whisper / distil-whisper variants; no LoRA fine-tuning has been applied to these voice models.

*Certified by Opus 4.6 Opus-Claw (Dr. C) on Veron-1 (RTX 5090, Mt Pleasant SC).*