Learn web scraping fundamentals by studying and running real working examples.
Download bulk content like music, videos, or novels from Chinese websites for personal use.
Practice Python HTTP requests, HTML parsing, and automation skills with concrete targets.
Understand how to handle common scraping challenges like captchas and proxy rotation.
This repository is a collection of practical Python 3 web scraping examples aimed at learners. Web scraping means writing code that automatically visits a website and extracts data from it, like downloading all the images from a photo site, pulling music from a streaming platform, or grabbing product listings from an online store. The project provides ready-to-run example scripts for a variety of Chinese websites and services, serving as a learning resource for Python beginners who want hands-on scraping practice. The included scripts cover a wide range of real-world targets: downloading novels from a web fiction site, bulk-downloading music from a Chinese streaming service, scraping Bilibili videos and comments, downloading TikTok (Douyin) videos without watermarks, solving GEETEST slider captchas, building a proxy IP pool, downloading manga chapters, fetching financial reports, and a basic train ticket booking helper for China's 12306 system. Each script is a self-contained demonstration using common Python libraries like Requests, BeautifulSoup, and Scrapy. Someone would use this repository when learning Python web scraping and wanting to see concrete, working examples rather than abstract tutorials. The README is written in Chinese and the scripts are for educational purposes only, the author explicitly disclaims commercial use and notes that scraping in violation of site terms of service carries legal risk in China.
Generated 2026-05-18 · Model: sonnet-4-6 · Verify against the repo before relying on details.