dondai1234/agent-browser

Analysis updated 2026-05-18

★ 20GoAudience · developerComplexity · 2/5LicenseSetup · easy

Mindmap

mindmap
  root((agent-browser))
    What it does
      Controls a browser for AI
      Returns deltas not full trees
      Reports action verdicts
    Tools available
      act click fill select
      login single and multi-step
      js structured data
      see nav scroll history
    Tech Stack
      Go single binary
      chromedp engine
      MCP server
    Use Cases
      AI web automation
      Login handling
      Data scraping
      Form filling

mindmap root((agent-browser)) What it does Controls a browser for AI Returns deltas not full trees Reports action verdicts Tools available act click fill select login single and multi-step js structured data see nav scroll history Tech Stack Go single binary chromedp engine MCP server Use Cases AI web automation Login handling Data scraping Form filling

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Connect an AI assistant to a real browser so it can log into sites, fill forms, and extract page data automatically.

USE CASE 2

Build an AI agent that navigates the web using a fraction of the tokens that Playwright MCP would cost.

USE CASE 3

Automate multi-step login flows, including sites like Google that show password on a separate screen, from one tool call.

What is it built with?

GochromedpMCP

How does it compare?

	dondai1234/agent-browser	abolix/xplex	pivanov/ctx-wire
Stars	20	20	20
Language	Go	Go	Go
Setup difficulty	easy	hard	easy
Complexity	2/5	3/5	2/5
Audience	developer	ops devops	developer

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · easy Time to first run · 5min

Requires downloading the binary and adding it to your MCP config file.

MIT license: use, modify, and distribute freely for any purpose including commercial use.

In plain English

agent-browser is a browser automation tool built for AI agents, not for humans to operate directly. It runs as a single Go binary with no Node.js, Playwright, or Puppeteer required, and it connects to AI assistants through the Model Context Protocol (MCP). When an AI needs to browse the web, fill a form, or pull data from a page, it calls this tool, which carries out the action and reports back a compact summary of what changed. The main design goal is reducing how many tokens the AI spends interpreting the browser. Other browser MCP servers send verbose accessibility trees after every action, which can cost tens of thousands of tokens per page. agent-browser sends compact reference lines and only what changed since the last step. On a measured login task, it used about 154 tokens compared to roughly 1,500 for Playwright MCP. It also exposes only 9 tools versus 22 or more for competing options, so connecting costs fewer tokens too. The tool gives the AI nine actions: act handles clicking, filling, and selecting any control by describing it in plain words, js runs JavaScript on the page and returns structured data as JSON rather than a list of page references, login handles a full sign-in sequence including multi-step flows like those used by Google and Microsoft, see takes a page snapshot, nav goes to a URL, and scroll, mark, and history round out the set. The login tool checks the resulting page state rather than just whether a click went through, so it distinguishes a successful login from a wrong-password error or a two-factor prompt. Version 3.2 added automatic cookie banner dismissal on every page load, support for a type of custom dropdown that standard accessibility tools cannot reach, and corrections to browser signals that can reveal a headless browser to bot-detection systems. The tool is licensed under MIT and installs as a single binary on macOS, Windows, and Linux. Configuration involves adding it to an MCP config file.

Copy-paste prompts

Prompt 1

Using agent-browser MCP, write a Claude agent that logs into example.com with a given username and password, then scrapes all table data from the dashboard and returns it as JSON.

Prompt 2

I have agent-browser connected to my AI assistant. Show me how to use the js tool to extract a table from a webpage without re-snapshotting the page.

Prompt 3

How do I add agent-browser to my MCP config file so that Claude Code or Cursor can control a browser?

Prompt 4

Using agent-browser's act tool, write a sequence of steps to search for a product on an e-commerce site and add the first result to the shopping cart.

Frequently asked questions

What is agent-browser?

A browser automation MCP server for AI agents that uses roughly 10x fewer tokens than Playwright, running as a single Go binary with no Node.js required.

What language is agent-browser written in?

Mainly Go. The stack also includes Go, chromedp, MCP.

What license does agent-browser use?

MIT license: use, modify, and distribute freely for any purpose including commercial use.

How hard is agent-browser to set up?

Setup difficulty is rated easy, with roughly 5min to a first successful run.

Who is agent-browser for?

Mainly developer.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub dondai1234 on gitmyhub

Verify against the repo before relying on details.