hold to speak • free and open source • never leaves your mac
Transcription in less than a second.
Free and open source. Regular automatic updates.
I use it daily and barely type anymore.
Proudly vibe coded. Let's improve it together →
| speakfree | Superwhisper | VoiceInk | Wispr Flow | Apple Dictation | |
|---|---|---|---|---|---|
| Price | Free | $249.99 | Free / $39.99 | $15/mo | Built-in |
| License | MIT | Closed | GPL v3 | Closed | — |
| Stays on your Mac | Always | Yes | Mostly | No — cloud only | M1+ only |
| Works on remote desktop | ✓ automatic | ✗ | ✗ | Paste manually | ✗ |
| Latency | under 1s | not published | not published | 1–2s (cloud) | variable |
| Account required | No | Yes | No | Yes | Apple ID |
| Works offline | Always | Yes | Yes | Never | Yes |
Yes — completely free and MIT licensed. No subscription, no trial, no premium tier. The source code is public and anyone can inspect it, fork it, or contribute. If you want to support development, there's a tip link on GitHub.
Never. Audio goes to a temporary file, gets transcribed locally by whisper.cpp, and is deleted. There are zero network requests during normal use — the only internet connection is the one-time model download on first launch.
Apple Silicon Macs (M1 or later) running macOS 14 Sonoma or later. Whisper runs on the GPU via Metal — Intel Macs are not supported.
Most dictation tools use a single paste strategy everywhere. That works for simple text fields but breaks silently in Electron apps like Slack, VS Code, and Discord; in web apps like Google Docs; and in remote desktops where the clipboard never reaches the remote machine.
speakfree detects what kind of app your cursor is in and picks the right approach automatically: Accessibility API for native Mac apps, clipboard paste (with your previous contents restored) for Electron apps, and keyboard emulation via AppleScript for remote desktops like Splashtop, TeamViewer, and Citrix. Other tools either ask you to configure this manually or just tell you to "paste manually" in remote sessions. speakfree handles it automatically.
Apple Dictation cuts off after 30–60 seconds of silence and struggles with technical terms, code, and anything that isn't plain English prose. Whisper is significantly more accurate — it handles jargon, acronyms, and proper nouns reliably. speakfree also gives you true push-to-talk (hold to record, release to insert), which Apple's built-in dictation doesn't support.
Yes — Whisper supports 99 languages. Go to Settings, pick your language, and speakfree will download the right multilingual model automatically. You can also set it to auto-detect the language from your speech.
http://localhost:5765