Why we built Yapper — a story about a competitor we liked, until we didn’t. Read why →
Yapper
Why we built it Download for macOS
Blog · 22 April 2026 · 8 min read

The fastest dictation app for Mac in 2026 a 5-app latency benchmark

We measured perceived latency on five popular Mac dictation tools. Yapper landed at 142ms median. The closest competitor was Glido at 410ms. Here is the methodology, the table, and what it means for the way you actually work.

TL;DR: We ran 100 dictations of an identical sample sentence in five Mac dictation apps on the same MacBook Pro M3, then measured the time from key-release to the first character of injected text. Yapper hit a median of 142ms. Glido was the closest competitor at 410ms. Apple Dictation, MacWhisper and Wispr Flow all sat between 590 and 880ms median. Below: the table, the methodology, and why the difference matters more than it looks.

What “fast” actually means in dictation

The metric users feel is perceived latency — wall-clock time between releasing the dictation hotkey and seeing the first character on screen. It’s distinct from two other numbers you sometimes see in vendor marketing:

  • Transcription time — how long the model takes to convert audio to text. Many apps hide this behind a spinner.
  • Cleanup time — how long the LLM takes to remove fillers, fix grammar, and adjust tone for the active app.

Perceived latency is the only one that matters for flow. If text appears before your eyes have re-focused on the cursor, dictation feels instant. Past about 300ms it starts to feel like waiting. Past 600ms it starts to feel like a feature you reach for — which is exactly the threshold we wanted to be on the right side of when we built Yapper.

The benchmark

Each app was given the same 12-second sentence to transcribe. Hardware: MacBook Pro 14″ with the M3 Pro chip, 36GB RAM, macOS 14.4. Network: a stable 200Mbps fibre line. Each app was warmed up with five throwaway dictations before recording. Median (p50) and worst-case (p95) figures across 100 trials per app:

Appp50p95Pipeline
Yapper142ms210msLocal Whisper + deferred LLM cleanup
Glido410ms680msLocal Whisper + blocking cleanup
MacWhisper590ms920msLocal Whisper, no cleanup
Apple Dictation760ms1,100msCloud transcription, no cleanup
Wispr Flow880ms1,340msCloud Whisper + cloud cleanup

Why the slow ones are slow

Three of the five apps ship a synchronous LLM cleanup pass between transcription and injection. Cleanup is the longest pole — the large language model has to be invoked, the prompt has to be sent, and the response has to come back before any text reaches the cursor. At conventional cloud speeds that’s a 300-700ms tax on every dictation.

Yapper inverts the order. Whisper-large-v3-turbo runs locally on Apple Silicon and finishes the transcription pass in under 100ms for a 10-second utterance. The raw text hits the screen the moment the key releases. The cleanup pass runs in the background and silently replaces the inserted range about 200ms later. By the time the cleaned version arrives, you’re already reading what you said. The pattern is described in more detail on our research page.

Why latency compounds

A casual user dictates 50 times a day. A heavy user — someone using voice as their primary input across email, chat, code, and AI prompts — dictates 200 to 500 times. At 800ms per dictation across 300 dictations, that’s four minutes of waiting per day, roughly 17 hours per working year. At 142ms it’s 42 seconds per day, around three hours a year. The fourteen hours you save sound small until you realise that they’re the difference between voice feeling like a workflow and voice feeling like a feature.

Methodology

The full test harness, sample sentence and raw timing data are open. Quick summary:

  • Each app configured for English-only, default voice. No custom dictionary loaded, to keep results comparable.
  • Latency measured by 120fps screen capture, frame-stepped to find the first frame where any non-whitespace character appears in the target text field.
  • Trials spaced 5 seconds apart to avoid thermal throttling effects.
  • Same Mac, same OS version, same network, same target text field.
  • Outliers (network blips, app stalls) reported as p95 rather than discarded.

Caveats

  • Cloud-based apps are slower on a worse network. We tested on a well-provisioned residential line; corporate networks with proxies will hurt cloud apps disproportionately.
  • Local-only apps are slower on Intel Macs. We did not test these here. If you’re on Intel, expect 2-3x degradation for any local-Whisper pipeline.
  • “Cleanup” is not standardised. Apps that skip it measure faster but produce rougher text. Yapper’s deferred model gives you both: instant raw text, then a quiet polish.
  • Yapper’s Instant Mode (Max plan) is expected to push p50 below 100ms in v1.1.

What this means in practice

If you dictate a lot, the only thing worth optimising is perceived latency. Accuracy on local Whisper-v3-turbo is now within 1-2% WER of the best cloud models on English, so the historical reason to prefer cloud transcription has largely evaporated. The deciding factor is whether your dictation tool can put text on screen before you start composing the next thought.

Yapper is built around exactly that constraint. You can install it for macOS and try it on your own sentences. Free tier gives you 2,500 words to start, no card needed.

Frequently asked questions

Why is Wispr Flow slower than Apple Dictation?

Wispr Flow runs both transcription and an LLM cleanup pass in the cloud, sequentially. Apple Dictation only does transcription. The LLM step buys Wispr cleaner output but costs you ~300ms.

Is local always faster than cloud for dictation?

For latency, yes. There is no network round-trip. For accuracy, local Whisper-large-v3-turbo is now within 1-2% word-error-rate of the best cloud models on English speech, so the trade-off is mostly historical.

How do I try Yapper?

Download Yapper for macOS. You get 2,500 free words on signup and earn more by referring friends. See plans and pricing.

What hardware does Yapper need?

macOS 13 or later. Apple Silicon (M1 or newer) recommended for the sub-150ms latency target; Intel Macs work but fall back to cloud ASR.


Want to try the fastest dictation tool we measured? Download Yapper for macOS — 2,500 free words, no card. Or read the next post: /blog.

Calculate your savings.

Plug in how many words you write a day and how much your hour is worth. We’ll show you what Yapper pays back.

Assumes typing at 45 wpm vs yapping at 220 wpm. Your mileage will, charmingly, vary.
You’d save
$16,130/yr
215
hours back
730k
words shipped

Start yapping.

2,500 words on us. Refer friends to earn more — or upgrade to Pro for unlimited.