explaingit

yangyangwithgnu/hardseed

9,212C++Audience · generalComplexity · 3/5Setup · moderate

TLDR

A command-line tool written in C++ that batch-downloads images and torrent files from two specific forums, with built-in multi-proxy concurrent download support for speed.

Mindmap

mindmap
  root((repo))
    What it does
      Batch downloads images
      Fetches torrent files
      13 content categories
    Proxy Support
      Shadowsocks
      SSH tunnels
      Multi-proxy concurrent
    Filtering
      Keyword include filter
      Keyword skip filter
      Download timeout
    Platform
      Linux from source
      macOS from source
      Windows binary included
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Batch-download images from a specific forum category by selecting one of 13 content classes and specifying how many topic pages to scan.

USE CASE 2

Speed up downloads by supplying multiple proxies simultaneously so the tool runs concurrent fetches through each one.

USE CASE 3

Filter downloads to only topics matching a keyword, or skip topics containing words you want to avoid.

Tech stack

C++

Getting it running

Difficulty · moderate Time to first run · 1h+

Must be built from source on Linux/macOS using a C++ compiler, target forums are blocked in China so a working proxy is required to use the tool.

In plain English

Hardseed is a command-line tool written in C++ that batch-downloads images and torrent files from two specific Chinese-language adult content forums, CaoLiu and AiCheng. You run it from a terminal, point it at a content category, and it automatically fetches the listed posts and saves the associated files to a folder on your computer. The tool organizes content into 13 categories, which it calls av classes. These separate the two source forums and divide the content by origin and type. You select one category per run and specify how many topic pages to scan. The program names the output folder automatically based on the category and time of download. Because the target forums are blocked in China, the tool is built around proxy support. It accepts connections through GoAgent, Shadowsocks, SSH tunnels, and VPN setups. You can supply multiple proxies at once, and the tool runs concurrent downloads through each one simultaneously, which the README says increases download speed significantly. Filtering options let you skip topics whose titles contain words you want to avoid, or restrict downloads to topics containing words you specify. You set a timeout for individual image downloads and control how many concurrent tasks run per proxy. The README is written partly in Chinese and partly in English. It includes animated GIF demos showing the tool running. The repository was last actively updated around early 2016. Installation instructions cover Linux and macOS from source, and a pre-compiled Windows binary is included in the repository.

Copy-paste prompts

Prompt 1
I have hardseed compiled on Linux. How do I configure it to use three Shadowsocks proxies simultaneously to maximize download speed from a specific category?
Prompt 2
Show me how to use hardseed's keyword filter flags to only download posts whose titles contain 'keyword-a' and skip any posts containing 'keyword-b'.
Prompt 3
Walk me through building hardseed from source on Ubuntu, what packages do I need to install and what build commands do I run?
Open on GitHub → Explain another repo

← yangyangwithgnu on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.