Scrape websites protected by Cloudflare or Datadome using anti-detection browser automation.
Package a finished scraper as a desktop app or web interface so non-technical clients can run it themselves.
Cut proxy costs by routing some requests directly through the browser instead of an external proxy service.
Run parallel scraping tasks with result caching to avoid re-fetching pages you have already processed.
Install via pip, some anti-detection features require a compatible browser driver to be present.
Botasaurus is a Python framework for building web scrapers that can get past bot-detection systems. Many websites use tools like Cloudflare, Fingerprint, and Datadome to identify and block automated traffic. Botasaurus is specifically designed to avoid triggering these systems by making the browser behave more like a real human, including realistic mouse movements and other signals that detection tools look for. The framework wraps around browser automation (similar to Selenium or Playwright) and HTTP request libraries, but adds a layer that masks the signs of automation. You write a Python function, add a decorator like @browser or @request to it, and Botasaurus handles the rest: launching the browser with anti-detection settings, managing browser profiles, rotating proxies, and saving results automatically as JSON files. A basic scraper that opens a page and extracts a heading can be written in about ten lines of code. Beyond detection bypassing, Botasaurus includes features aimed at reducing the cost and complexity of scraping at scale. It claims to cut proxy costs by up to 97% by sending some requests directly from within the browser rather than routing all traffic through an external proxy server. It also supports running multiple scraping tasks in parallel, caching results to avoid refetching the same data, and distributing work across multiple machines using Kubernetes. One notable feature is the ability to package a finished scraper as a desktop application for Windows, macOS, or Linux, or to turn it into a web interface that non-technical users can operate through a browser. This is aimed at developers who build scraping tools for clients or customers who do not want to use a command line. Installation is through pip. The project is split across several packages (botasaurus, botasaurus-api, botasaurus-driver, botasaurus-server, and others) that can be installed and upgraded together. The full README is longer than what was shown.
← omkarcloud on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.