bramblexu/pydata-notebook

Analysis updated 2026-07-03

★ 4,664Jupyter NotebookAudience · dataComplexity · 1/5LicenseSetup · easy

Mindmap

mindmap
  root((repo))
    What it is
      Book translation
      Chinese language
      Jupyter Notebooks
    Topics
      NumPy arrays
      pandas tables
      Data cleaning
      Time series
    Datasets used
      Movie ratings
      US baby names
    License
      MIT for code
      Python 3

mindmap root((repo)) What it is Book translation Chinese language Jupyter Notebooks Topics NumPy arrays pandas tables Data cleaning Time series Datasets used Movie ratings US baby names License MIT for code Python 3

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Follow along with Python for Data Analysis in Chinese using interactive notebooks you can run and modify as you read.

USE CASE 2

Learn pandas and NumPy fundamentals by executing real code examples on datasets like movie ratings and US baby names.

USE CASE 3

Use the time series and data cleaning chapters as a hands-on reference when working on your own data projects.

What is it built with?

PythonJupyter NotebookNumPypandas

How does it compare?

	bramblexu/pydata-notebook	dusty-nv/jetson-containers	makcedward/nlpaug
Stars	4,664	4,660	4,657
Language	Jupyter Notebook	Jupyter Notebook	Jupyter Notebook
Setup difficulty	easy	moderate	easy
Complexity	1/5	3/5	2/5
Audience	data	researcher	data

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · easy Time to first run · 5min

Requires Python 3 with pandas and NumPy installed, notebooks run locally in Jupyter or JupyterLab.

Code samples are released under MIT, use freely for any purpose, including commercial projects, as long as you keep the copyright notice.

In plain English

This repository contains a partial Chinese translation of the book "Python for Data Analysis, Second Edition" (2017), written by Wes McKinney, who created the pandas data analysis library. The translation is presented as Jupyter Notebooks, which are interactive documents that mix explanatory text with runnable Python code, making them useful for reading and experimenting at the same time. The translator worked through selected chapters of the book and rendered them in Chinese for readers who find the English original difficult to follow. The chapters included cover the core tools used in data analysis with Python: NumPy for working with arrays and numerical computation, pandas for handling structured data tables, data cleaning and preparation techniques, time series analysis, advanced pandas features, and several worked examples using real datasets such as movie ratings and US baby name records. The translation is not word-for-word. The translator chose to paraphrase and adapt the content for clarity, and acknowledges that some errors or awkward passages may exist since this was a solo effort. The translator also notes that the full translation is kept partial out of respect for copyright, because the book's Chinese translation rights belong to a publisher and were not granted to this project. The original author of the book was contacted and confirmed there are no copyright concerns with this notebook-format study companion. All code samples in the notebooks are released under the MIT license. The project uses Python 3, updated from the first edition which used Python 2. This is primarily a study resource for Chinese-speaking learners who want to follow along with the book using interactive notebooks rather than static text.

Copy-paste prompts

Prompt 1

Using the pandas techniques from bramblexu/pydata-notebook, load a CSV file of sales records and compute the monthly total revenue grouped by product category.

Prompt 2

Show me how to clean a dataset with missing values using the methods covered in pydata-notebook's data preparation chapter.

Prompt 3

Using the NumPy section from pydata-notebook, write code to create a 2D array of random numbers and compute row-wise and column-wise means.

Prompt 4

Apply the time series analysis techniques from pydata-notebook to a dataset of daily stock prices: resample to weekly averages and plot the result.

Prompt 5

Using the movie ratings dataset example from pydata-notebook, find the top 10 highest-rated movies that have at least 100 reviews.

Frequently asked questions

What is pydata-notebook?

A partial Chinese translation of "Python for Data Analysis" rendered as interactive Jupyter Notebooks, covering NumPy, pandas, data cleaning, and time series with runnable Python 3 code examples.

What language is pydata-notebook written in?

Mainly Jupyter Notebook. The stack also includes Python, Jupyter Notebook, NumPy.

What license does pydata-notebook use?

Code samples are released under MIT, use freely for any purpose, including commercial projects, as long as you keep the copyright notice.

How hard is pydata-notebook to set up?

Setup difficulty is rated easy, with roughly 5min to a first successful run.

Who is pydata-notebook for?

Mainly data.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub bramblexu on gitmyhub

Verify against the repo before relying on details.