Analysis updated 2026-05-18
Build a web crawler that makes thousands of requests without getting blocked by rotating through different IP addresses.
Collect data from websites that rate-limit or ban repeated requests from the same source.
Test your application's behavior when accessed from different geographic locations using proxy rotation.
| jhao104/proxy_pool | deepseek-ai/deepseek-coder | vanna-ai/vanna | |
|---|---|---|---|
| Stars | 23,317 | 23,251 | 23,386 |
| Language | Python | Python | Python |
| Setup difficulty | moderate | moderate | moderate |
| Complexity | 3/5 | 3/5 | 3/5 |
| Audience | developer | developer | pm founder |
Figures from each repo's GitHub metadata at analysis time.
Requires Docker to run Redis and the proxy service, API key or proxy source configuration may be needed.
proxy_pool is a Python tool that automatically gathers and maintains a pool of free proxy IP addresses for use in web scraping. A proxy is an intermediary server that lets your program fetch web pages using a different IP address, which helps avoid being blocked by websites that restrict repeated requests from the same source. The tool works in two main parts running at once: a scheduler that continuously collects proxy addresses from several free proxy listing websites, verifies whether each one actually works, and stores the valid ones in a Redis database (a fast in-memory data store), and a small web API server that your scraper can query to get a working proxy on demand. You interact with it through simple web endpoints, ask for a random proxy, fetch all proxies, check the count, or delete a bad one. When a proxy stops working, your scraper can report it for removal and request a fresh one. The system also lets you add your own proxy sources if the built-in free ones are not reliable enough for your needs. You would use this when building a web crawler or data collection script that needs to make many requests without triggering rate limits or bans. It is written in Python and uses Redis for storage, and can be run directly or via Docker.
Automatically collects, verifies, and serves working proxy IP addresses for web scraping via a simple API, storing them in Redis.
Mainly Python. The stack also includes Python, Redis, Docker.
Use freely for any purpose including commercial, as long as you keep the copyright notice.
Setup difficulty is rated moderate, with roughly 30min to a first successful run.
Mainly developer.
This repo across BitVibe Labs
Verify against the repo before relying on details.