explaingit

cv-cat/spider_xhs

5,821JavaScriptAudience · developerComplexity · 3/5LicenseSetup · moderate

TLDR

A scraper and publisher for Xiaohongshu (Little Red Book) that collects posts, comments, and videos, and auto-publishes content, built as a foundation for AI-powered social media automation workflows.

Mindmap

mindmap
  root((repo))
    What it does
      Scrapes posts and comments
      Downloads images and videos
      Publishes content
    Data Sources
      Regular XHS website
      Creator platform
      Influencer marketplace
    AI Workflows
      Competitor content rewriting
      Keyword trend monitoring
      Influencer scoring
    Setup
      Python 3.10 plus
      Node.js 20 plus
      Browser cookie auth
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Collect posts, comments, and engagement data from Xiaohongshu accounts or search results for research

USE CASE 2

Download Xiaohongshu images and videos without watermarks for archival or analysis

USE CASE 3

Build an AI content pipeline that collects competitor posts, rewrites them with an AI model, and auto-publishes

USE CASE 4

Monitor keywords on Xiaohongshu to track trending topics and surface insights for your niche

Tech stack

JavaScriptPythonNode.jsDocker

Getting it running

Difficulty · moderate Time to first run · 30min

Requires Python 3.10+, Node.js 20+, and manual browser cookie extraction for authentication, Docker deployment available as an easier alternative.

Intended for learning and research only, commercial use is explicitly not permitted by the project author.

In plain English

Spider_XHS is a data collection and content publishing tool for Xiaohongshu, a Chinese social media platform sometimes called Little Red Book or RedNote. Xiaohongshu does not provide a public API for most of its features, so this project works by reverse-engineering the platform's internal web requests, including the authentication signatures the platform uses to verify that requests come from its own app. The README is written in Chinese. The tool covers three main areas. The first is data collection from the regular Xiaohongshu website: logging in, searching posts, reading post content and comments, downloading images and videos without watermarks, and fetching user profile information. The second is publishing through the creator platform: uploading image sets and videos, and managing published content. The third is data from Xiaohongshu's influencer marketplace, where brands find creators to work with, including follower demographics and historical trend data. The project is designed to work as a foundation for AI-powered content operations workflows. The README includes code examples showing how you would connect it to an AI model: collect posts from a competitor account, pass the content to an AI for rewriting, then automatically publish the result to your own creator account. Other example workflows include keyword monitoring for trend analysis and influencer selection using AI scoring. Setup requires Python 3.10 or higher and Node.js 20 or higher. Authentication is done by copying your browser cookies after logging in manually and placing them in a configuration file. Docker deployment is also supported. The project includes a companion repository called XHS_ALL_IN_ONE with a graphical interface for account management and publishing. The README notes the project is intended for learning and research only, not commercial use.

Copy-paste prompts

Prompt 1
Using spider_xhs, how do I collect the last 30 posts from a specific Xiaohongshu account and export their titles, descriptions, and like counts to a CSV?
Prompt 2
Help me write a Python script with spider_xhs that: (1) searches a keyword on XHS, (2) collects the top 10 posts, (3) sends each post to an AI for rewriting, and (4) publishes the result to my creator account.
Prompt 3
I want to use spider_xhs to download videos from Xiaohongshu without watermarks. Walk me through getting my browser cookies and configuring the auth file.
Prompt 4
Using the spider_xhs influencer marketplace module, write code to fetch follower demographics for a list of creator usernames and rank them by engagement rate.
Prompt 5
How do I deploy spider_xhs with Docker so I can run it on a server without setting up Python and Node.js manually?
Open on GitHub → Explain another repo

← cv-cat on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.