I built a fully offline, FOSS Push-to-Talk dictation tool for Windows because the good ones are Mac-exclusive (C#)
https://reddit.com/link/1tgta4x/video/rabket39ix1h1/player
Hey everyone,
I’ve been watching a lot of workflow videos lately, and there’s a recurring theme: typing is becoming the main bottleneck when interacting with AI or doing heavy code reviews. I wanted a local speech-to-text tool to fix this, but the landscape on Windows is frustrating:
- Built-in tools are clunky.
- The best local FOSS tools are macOS-exclusive.
- The rest are heavy Electron/Python apps that eat up too much RAM just to run in the background.
So, I built Echo - a completely free, open-source C# utility.
What it does: It's a minimal Push-to-Talk app. You hold a global hotkey, speak, and Echo transcribes your audio using a local Whisper model. If enabled in the settings, it instantly pastes the text into your active window (IDE, browser, Teams, Slack).
Why I think it fits here:
- 100% Private & Offline: Zero telemetry, no cloud APIs. Audio never leaves your machine.
- Zero UI Bloat: It runs quietly in the console.
- VAD Built-in: Uses Voice Activity Detection to drop empty audio, fixing those infamous Whisper hallucinations.
- Hardware Choice: Supports CUDA/Vulkan for fast inference, with a CPU fallback.
- Seamless Translation: You can speak in your native language, and Whisper will translate and type it seamlessly in English.
- Highly Customizable: Tweak hotkeys, volume, VAD sensitivity, and Whisper settings via config. You also choose which GGML model size fits your hardware (check the comparison table on the repo!).
System Requirements:
- OS: Windows 10 or 11.
- Hardware: Highly scalable. You can comfortably run smaller models (
baseorsmall) strictly on your CPU. However, for near-instant inference with larger models, a dedicated GPU is recommended (CUDA and Vulkan are fully supported). - Memory (RAM/VRAM): Depends entirely on the model you choose — ranging from ~500 MB for the
tinymodel up to ~6 GB+ forlarge. - Storage: The app itself is incredibly lightweight, but expect to use between 150 MB and 3 GB of disk space for the downloaded Whisper
.binmodels.
Links:
- 💻 GitHub: https://github.com/GithubPhobos/Echo
Feel free to grab the code, tear it apart, or use it for your daily workflow!