VOICE TO TEXT

Privacy-first. Blazing fast. Hold a key, speak, release — text appears where you need it.

Built Different

Hold → Speak → Release

Global hotkey captures audio instantly. Release to transcribe. Text injected into your active app.

Your Machine. Your Data.

All transcription runs locally. Audio and logs never leave your device unless you explicitly opt in.

1.5s for 30s Audio

Written in native code with GPU acceleration. The fastest local transcription engine available.

Optional Online Services

Use your own API key for cloud LLMs. Enable or disable at any time — your choice, always.

Backend Flexibility

Auto-detect, CPU-only, or GPU acceleration. Choose the model size that fits your hardware.

LLM-Powered Accuracy

Route audio through Gemini, GPT, or specialized models for ultra-accurate transcription of technical content.

Raw Speed

30 seconds of English audio

OiPer Desktop1.5s
Lemonfox API3.27s
Python Faster-Whisper3.55s
OpenAI Whisper 1 API6.46s

Privacy Is Not Optional

Every transcription runs on your hardware. Your audio never leaves your machine. Activity logs stay local. Online services are available — but only when you choose, with your own API keys.

Local Transcription

Runs entirely on your CPU or GPU

No Telemetry

Zero data collection by default

Your API Keys

Online features use your own credentials

codex 1codex 2codex 3codex 4codex 5
codex2 1codex2 2codex2 3codex2 4codex2 5
gemini 1gemini 2gemini 3gemini 4gemini 5
glm 1glm 2glm 3glm 4glm 5
sonnet 1sonnet 2sonnet 3sonnet 4sonnet 5
kimi 1kimi 2kimi 3kimi 4kimi 5
codex53 1codex53 2codex53 3codex53 4codex53 5
opus 1opus 2opus 3opus 4opus 5
qwen 1qwen 2qwen 3qwen 4qwen 5
sonnet2 1sonnet2 2sonnet2 3sonnet2 4sonnet2 5
lovable 1lovable 2lovable 3lovable 4lovable 5
v0 1v0 2v0 3v0 4v0 5