Cua (trycua/cua) is an open-source toolkit for building, testing, and running AI agents that can operate full desktop computers, clicking, typing, taking screenshots, and completing tasks autonomously across macOS, Linux, Windows, and Android. The project is structured as several interconnected tools. The core component is a sandbox SDK (a safe, isolated environment) that lets you spin up virtual machines or containers running any operating system, then programmatically control them. A code example shows creating a Linux sandbox and then running shell commands, taking screenshots, clicking at coordinates, and typing text, all with the same Python API regardless of which OS you target. Sandboxes can run locally via QEMU (a virtualization tool) or in the cloud. Cua Driver handles background computer-use on macOS, an AI agent can click and type in native apps without interrupting the user, even in apps where normal accessibility shortcuts don't work. CuaBot is a command-line tool that gives any AI coding agent a sandbox to work in, with support for browser control, mobile device simulation, and screenshot capture. Cua-Bench provides benchmarks and testing environments for measuring how well computer-use agents perform on standardized tasks, and can export recordings for training AI models. Lume handles macOS and Linux virtual machine management on Apple Silicon hardware. The project is written primarily in HTML with Python SDKs and suits AI researchers, developers building automation, and teams evaluating computer-use AI models.
Generated 2026-05-21 · Model: sonnet-4-6 · Verify against the repo before relying on details.