Analysis updated 2026-06-24
Record a robot that scrapes daily product prices from an ecommerce site into Google Sheets
Convert a documentation site to clean Markdown for feeding into a RAG pipeline
Crawl a competitor blog and push every new post into Airtable
Self host Maxun with Docker and call its REST API from another app
| getmaxun/maxun | microsoft/data-formulator | infinitered/reactotron | |
|---|---|---|---|
| Stars | 15,564 | 15,549 | 15,543 |
| Language | TypeScript | TypeScript | TypeScript |
| Setup difficulty | moderate | moderate | easy |
| Complexity | 3/5 | 3/5 | 2/5 |
| Audience | data | data | developer |
Figures from each repo's GitHub metadata at analysis time.
Self hosting needs Docker plus credentials for any destination like Google Sheets or Airtable.
Maxun is an open-source, no-code web data platform written in TypeScript that lets you extract structured data from websites without writing code. It works by recording your browser actions and turning them into reusable automated "robots" that can collect data on a schedule. The platform supports four main types of robots. Extract robots emulate real user behavior and can operate in either Recorder Mode (where you demonstrate what to click and collect, and Maxun remembers it) or AI Mode (where you describe in plain language what data you want). Scrape robots convert full web pages into clean Markdown or HTML, useful for feeding content into AI workflows. Crawl robots visit every page of a website and collect content from each one. Search robots run automated web searches with optional time-based filters. Extracted data can be exported to Google Sheets, Airtable, or served as a RESTful API (a structured data endpoint). The platform handles pagination, scrolling, website layout changes, and authentication. A developer SDK and command-line tool are also available for programmatic use. Maxun can be used via the hosted app at maxun.dev or self-hosted using Docker. It is licensed under the AGPLv3 open-source license.
Open source no-code web scraping platform. Record browser actions or describe what you want, then schedule robots that export to Sheets, Airtable, or a REST API.
Mainly TypeScript. The stack also includes TypeScript, Docker, Playwright.
AGPLv3. You can use and modify it for free but if you offer it as a hosted service to others you must publish your changes.
Setup difficulty is rated moderate, with roughly 30min to a first successful run.
Mainly data.
This repo across BitVibe Labs
Verify against the repo before relying on details.