Question 1

What's the difference between Parakeet and Whisper?

Accepted Answer

Both run on your device. Parakeet (from NVIDIA) is optimized for speed and excels at real-time dictation, especially on Apple Silicon. Whisper (from OpenAI) tends to be the most accurate across a very wide range of languages and accents. Vowen lets you pick based on what you value most.

Question 2

Do I need an internet connection?

Accepted Answer

No. With an on-device model (Parakeet or Whisper), Vowen transcribes entirely on your machine — no internet, and your audio never leaves the device. Cloud models are optional, for when you want maximum speed or a specific provider.

Question 3

Which model is the most accurate?

Accepted Answer

For most languages, Whisper Large v3 on-device and frontier cloud models like gpt-4o-transcribe are at the top for accuracy. For speed, Parakeet on-device and Groq's Whisper in the cloud are hard to beat. The best choice depends on your hardware and whether you need offline.

Question 4

Are cloud models private?

Accepted Answer

On-device models keep everything local by default. If you choose a cloud model, audio is sent to that provider only to produce the transcript. For sensitive or regulated work, use an on-device model to keep audio on your machine.

Question 5

Can I switch models per task?

Accepted Answer

Yes. You can choose different models for different situations — a fast on-device model for quick dictation, or a higher-accuracy model for important transcripts.

Model	Provider	Runs	Best for
Parakeet TDT v3	NVIDIA	On-device	Fastest on-device transcription; great real-time dictation on Apple Silicon.
Parakeet TDT v2	NVIDIA	On-device	Proven, low-latency model for live voice typing.
Parakeet CTC 0.6B	NVIDIA	On-device	Lightweight rescorer used for custom-vocabulary accuracy.
Whisper Large v3	OpenAI	On-device	Highest on-device accuracy across 100+ languages.
Whisper Large v3 Turbo	OpenAI	On-device	Near-large accuracy at a fraction of the compute.
Whisper (small / medium)	OpenAI	On-device	Smaller footprints for older or lower-RAM machines.

Model	Provider	Runs	Best for
Whisper Large v3 / Turbo	Groq	Cloud	Extremely fast cloud transcription via Groq's LPU.
Nova-3 / Nova-2	Deepgram	Cloud	Fast, accurate streaming with strong punctuation.
Scribe	ElevenLabs	Cloud	High-accuracy transcription with robust formatting.
Ink-Whisper	Cartesia	Cloud	Low-latency real-time streaming for live dictation.
Soniox	Soniox	Cloud	Multilingual transcription with speaker context.
Voxtral	Mistral	Cloud	Open-weight model with strong multilingual coverage.
gpt-4o-transcribe	OpenAI	Cloud	Frontier-model accuracy for tough audio.

AI models in Vowen

Two kinds of models, one app

Private, offline transcription

Fastest turnaround, by choice

On-device by default, cloud by choice

Common questions.

Where to go next

Transcribe audio to text

Free voice tools

Compare Vowen

One app, every model.