Transcription Models

Overview of local and cloud transcription models — speed, accuracy, and privacy trade-offs.

VivaDicta supports a wide range of transcription models — from fully local and private to lightning-fast cloud services.

Local Models

Local models run entirely on your Mac. No internet required, no data leaves your device.

Whisper

  • OpenAI's Whisper model running locally on your Mac.
  • Multiple model sizes: Tiny, Base, Small, Medium, Large, Large Turbo.
  • Larger models are more accurate but require more RAM and processing time.
  • Best on Apple Silicon Macs (M1 and later).
  • Supports 100+ languages.

Parakeet

  • NVIDIA's Parakeet model optimized for Apple Silicon.
  • Supports 25+ languages with automatic language detection (v3).
  • Fast and accurate — a great alternative to Whisper.

Apple Speech

  • Uses macOS built-in speech recognition.
  • No model download needed — works immediately.
  • Good for quick dictation, less accurate for longer recordings.

Cloud Models

Cloud models send your audio to a remote server for transcription. They're typically faster and more accurate than local models, especially on older Macs.

ProviderFree TierStrengthsAPI Key
GroqFree forever (rate-limited)Fastest cloud transcriptionGet key
Deepgram$200 free creditsHigh accuracy (Nova-3)Get key
ElevenLabsFree tier (limited)Great multi-language (Scribe 2)Get key
Mistral (Voxtral)Free Experiment planOpen-source, multilingualGet key
Google GeminiFree tier + $300 creditsGoogle ecosystemGet key
SonioxCheck their siteReal-time, diarizationGet key
OpenAIPay-as-you-goOriginal Whisper APIGet key

How to Choose

  • Want free & fast? Groq — our #1 recommendation.
  • Want maximum privacy? Local Whisper Large Turbo or Parakeet.
  • Want best accuracy? Deepgram Nova-3 or ElevenLabs Scribe 2.
  • On an older Mac? Cloud models offload compute — try Groq.

See Recommended Models for detailed recommendations by use case.