Scrape a website that blocks Selenium by using human-like movements and no detectable WebDriver signature
Define a typed Python schema class with CSS selectors to extract structured data from a page in a single call
Intercept browser network traffic to discover hidden API endpoints used by a website
Automate interactions inside shadow DOM elements or cross-origin iframes that other tools cannot reach
Requires Chrome or Edge browser installed on the system.
Pydoll is a Python library for controlling Chrome or Edge browsers automatically. Unlike similar tools, it talks directly to the browser using a low-level protocol called Chrome DevTools Protocol rather than going through a separate WebDriver program. This means there is no extra binary to install and no flag that websites can detect to identify automated traffic. The library is built around Python's async system, which lets it run many browser operations at once without blocking. It is fully type-checked, meaning your code editor can offer autocompletion and catch mistakes before you run anything. Mouse movements and typing are designed to mimic human behavior by default, using curved movement paths, variable timing, and small random noise, which makes automated browsing harder for anti-bot systems to spot. For extracting data from web pages, Pydoll offers a structured approach: you define a Python class describing the fields you want (like title, author, or tags), point each field at a CSS selector, and call a single extract method. The result is a typed Python object with all the values filled in, rather than raw HTML strings you have to parse yourself. This works inside shadow DOM elements and cross-origin iframes as well, which other automation tools often cannot reach. Network-level control is also built in. You can intercept outgoing requests to block ads or trackers, monitor traffic to find hidden API endpoints, or make HTTP requests that carry the browser's current cookies and session state. Installation is a single pip command with no extra dependencies. The README includes working code examples for navigation, interaction, and data extraction. A separate documentation site covers all features in detail. The project is open source and actively sponsored by several web scraping service providers.
← autoscrape-labs on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.