Analysis updated 2026-05-18
Control your Windows PC hands-free using voice commands for media, apps, and browser navigation without any internet connection.
Run multi-step agent tasks by voice, such as finding files, searching the web, or clicking on-screen elements.
Add custom voice commands for your own workflows by writing a simple Python plugin file.
| aryangonsalves/voiceclaw | 0marildo/imago | agentlexi/agent-lexi | |
|---|---|---|---|
| Stars | 3 | 3 | 3 |
| Language | Python | Python | Python |
| Setup difficulty | moderate | easy | moderate |
| Complexity | 3/5 | 2/5 | 4/5 |
| Audience | general | general | vibe coder |
Figures from each repo's GitHub metadata at analysis time.
Requires an API key for AI agent features, local control and speech recognition work without one. App is not code-signed and may trigger a Windows SmartScreen warning.
VoiceClaw is a lightweight always-listening voice assistant for Windows that lets you control your computer by speaking. After saying a wake word like "Hey Jarvis" or "Alexa," you can open and close applications, scroll pages, control media playback, manage files, search the web, and hand off complex multi-step tasks to an AI agent. The whole thing runs in the background and uses only the wake-word detection while idle. The system uses a four-tier approach to keep common commands instant and reserve cloud AI for genuinely difficult requests. The first tier is a learned command cache: phrases you have used before resolve immediately to their actions without checking anything else. The second tier handles simple local control commands like "open Chrome," "next video," or "volume up" with no internet and no model. The third tier routes short factual questions to a local Ollama model if one is running. The fourth tier sends complex or multi-step requests (reasoning, web lookups, file operations, or clicking on-screen elements) to a Claude or OpenAI agent using your own API key. If you do not have any AI API key, the free tiers still cover direct PC control and on-device speech recognition. You can also switch to push-to-talk or hotkey mode if you prefer not to have a wake word running continuously. The companion desktop app (built with PySide6) shows a dashboard with live listening status and a command tester, lets you manage wake words, microphone settings, and hotkeys, and displays a log of recent issues. A plugin system lets you add your own voice commands by dropping a Python file into a plugins folder. VoiceClaw is available as a prebuilt Windows installer or a portable ZIP from the Releases page, or runnable from source with Python. The README notes the app is not code-signed yet, so Windows may show a security warning on first run.
An always-listening Windows voice agent that routes commands through on-device PC control, a local AI model, and cloud AI, letting you control your computer and run complex tasks by speaking.
Mainly Python. The stack also includes Python, Windows, faster-whisper.
Setup difficulty is rated moderate, with roughly 30min to a first successful run.
Mainly general.
This repo across BitVibe Labs
Verify against the repo before relying on details.