Analysis updated 2026-05-18
Work through interactive notebooks while reading the Python for Data Analysis book to learn data manipulation with pandas and NumPy.
Practice loading, cleaning, and reshaping real-world datasets using runnable code examples.
Create visualizations and analyze time series data by experimenting with the provided matplotlib and pandas examples.
Reference working code snippets for common data analysis tasks like joining tables and handling missing values.
| wesm/pydata-book | trekhleb/homemade-machine-learning | microsoft/omniparser | |
|---|---|---|---|
| Stars | 24,540 | 24,516 | 24,723 |
| Language | Jupyter Notebook | Jupyter Notebook | Jupyter Notebook |
| Setup difficulty | easy | easy | moderate |
| Complexity | 2/5 | 2/5 | 4/5 |
| Audience | general | developer | researcher |
Figures from each repo's GitHub metadata at analysis time.
This repository contains the companion code and Jupyter Notebooks for the book "Python for Data Analysis, 3rd Edition" by Wes McKinney, published by O'Reilly Media. Wes McKinney is the creator of pandas, the most widely used Python library for working with structured data. The notebooks cover data analysis from the ground up using Python. Topics include Python language basics, working with NumPy arrays (a library for numerical computing), loading and cleaning real-world datasets, reshaping and joining data tables, creating visualizations, analyzing time series data, and an introduction to modeling. Each chapter of the book has a corresponding interactive notebook where you can run and experiment with the code. You would use this repository as a hands-on companion while reading the book, or as a free reference for learning data analysis in Python. The book content itself is also freely available on the author's website. The tech stack is Python, with Jupyter Notebooks as the interactive environment, and libraries including pandas, NumPy, and matplotlib.
Interactive Jupyter Notebooks and code examples for learning data analysis in Python, covering NumPy, pandas, visualization, and time series.
Mainly Jupyter Notebook. The stack also includes Python, Jupyter Notebook, pandas.
License could not be detected automatically. Check the repository's LICENSE file before use.
Setup difficulty is rated easy, with roughly 5min to a first successful run.
Mainly general.
This repo across BitVibe Labs
Verify against the repo before relying on details.