Hold a global hotkey. Speak. Release.
Whisper processes audio locally — no cloud, no lag, no data leaks.
Text injects directly into any active application.
Press-and-hold activation across every app, every window. No context switch.
Whisper runs on your hardware. Audio never leaves the machine unless you choose.
CUDA and Metal supported. Auto-detect or force-select your backend.
Gemini Flash Lite, local models, any OpenAI-compatible API for text polish.
Transcribed text lands in your focused input. No clipboard. No paste.
Choose Whisper model size. Download, switch, and delete from built-in UI.
Tested on 30 seconds of English audio. Native code wins — every time.
Recording is captured and processed locally. Never persisted by default.
Whisper runs on your CPU or GPU. No API calls, no data egress, ever.
Text polishing via cloud is entirely optional. Requires your own API key.
Fast, local, and entirely under your control.