Work through interactive notebooks while reading the Python for Data Analysis book to learn data manipulation with pandas and NumPy.
Practice loading, cleaning, and reshaping real-world datasets using runnable code examples.
Create visualizations and analyze time series data by experimenting with the provided matplotlib and pandas examples.
Reference working code snippets for common data analysis tasks like joining tables and handling missing values.
This repository contains the companion code and Jupyter Notebooks for the book "Python for Data Analysis, 3rd Edition" by Wes McKinney, published by O'Reilly Media. Wes McKinney is the creator of pandas, the most widely used Python library for working with structured data. The notebooks cover data analysis from the ground up using Python. Topics include Python language basics, working with NumPy arrays (a library for numerical computing), loading and cleaning real-world datasets, reshaping and joining data tables, creating visualizations, analyzing time series data, and an introduction to modeling. Each chapter of the book has a corresponding interactive notebook where you can run and experiment with the code. You would use this repository as a hands-on companion while reading the book, or as a free reference for learning data analysis in Python. The book content itself is also freely available on the author's website. The tech stack is Python, with Jupyter Notebooks as the interactive environment, and libraries including pandas, NumPy, and matplotlib.
Generated 2026-05-18 · Model: sonnet-4-6 · Verify against the repo before relying on details.