Let an AI assistant automatically open applications, fill out forms, and complete multi-step desktop tasks on your Windows PC.
Automate browser interactions where the AI reads the underlying page structure directly for more reliable clicks than pixel-matching.
Connect Claude Desktop or another MCP-compatible AI client to your Windows computer for hands-free task automation.
Requires Python 3.13+ and an MCP-compatible AI client like Claude Desktop, Windows 7-11 only, no macOS or Linux.
Windows-MCP is a Python server that gives AI agents the ability to control a Windows computer. MCP stands for Model Context Protocol, a standard way for AI assistants to connect to external tools and capabilities. When you run Windows-MCP and connect it to an AI assistant like Claude, the assistant can perform actions on your desktop: open applications, click buttons, type text, navigate folders, read what is currently on screen, and interact with web pages through a browser. The server works with any AI language model, not just one specific product. It does not rely on image recognition or screenshot analysis to find interface elements. Instead, it reads the Windows accessibility layer, which is the same infrastructure that screen readers use to help people with visual impairments navigate the operating system. This makes interactions faster and more consistent than tools that try to match pixels on screen. Typical latency between consecutive actions is between 0.2 and 0.5 seconds, depending on how many applications are running and how quickly the language model generates its next instruction. A special mode for browser automation reads the underlying web page structure directly rather than treating the browser as a visual grid, which makes web interactions more precise. Installation is a single command using the uv package manager. The server can optionally be registered as a Windows background task that starts automatically at login. Configuration instructions are included for Claude Desktop, Perplexity Desktop, and other MCP-compatible clients. The project supports Windows 7 through Windows 11, requires Python 3.13 or newer, and is licensed under MIT.
← cursortouch on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.