AI models in Vowen
Vowen runs best-in-class speech models — on your device for privacy and offline use, or in the cloud for maximum speed. Pick the one that fits the job.
How to read this
Two kinds of models, one app
On-device models run entirely on your computer. Your audio never leaves the machine and you don't need an internet connection — ideal for private, offline, or regulated work. Cloud models send audio to a provider for the fastest possible turnaround or a specific accuracy profile. Vowen supports both, and you choose per situation.
On-device
Private, offline transcription
| Model | Provider | Runs | Best for |
|---|---|---|---|
| Parakeet TDT v3 | NVIDIA | Fastest on-device transcription; great real-time dictation on Apple Silicon. | |
| Parakeet TDT v2 | NVIDIA | Proven, low-latency model for live voice typing. | |
| Parakeet CTC 0.6B | NVIDIA | Lightweight rescorer used for custom-vocabulary accuracy. | |
| Whisper Large v3 | OpenAI | Highest on-device accuracy across 100+ languages. | |
| Whisper Large v3 Turbo | OpenAI | Near-large accuracy at a fraction of the compute. | |
| Whisper (small / medium) | OpenAI | Smaller footprints for older or lower-RAM machines. |
Cloud
Fastest turnaround, by choice
| Model | Provider | Runs | Best for |
|---|---|---|---|
| Whisper Large v3 / Turbo | Groq | Extremely fast cloud transcription via Groq's LPU. | |
| Nova-3 / Nova-2 | Deepgram | Fast, accurate streaming with strong punctuation. | |
| Scribe | ElevenLabs | High-accuracy transcription with robust formatting. | |
| Ink-Whisper | Cartesia | Low-latency real-time streaming for live dictation. | |
| Soniox | Soniox | Multilingual transcription with speaker context. | |
| Voxtral | Mistral | Open-weight model with strong multilingual coverage. | |
| gpt-4o-transcribe | OpenAI | Frontier-model accuracy for tough audio. |
On-device by default, cloud by choice
Vowen defaults to processing on your machine. If you never want audio to leave your device, pick an on-device model and you're fully offline — no account, no upload, nothing stored on a server.
Support
Common questions.
What's the difference between Parakeet and Whisper?
Do I need an internet connection?
Which model is the most accurate?
Are cloud models private?
Can I switch models per task?
One app, every model.
Run speech recognition your way — on-device or in the cloud. Free tier that doesn't expire.