Analysis updated 2026-06-24
Reproduce a FiveThirtyEight politics or sports analysis from raw data
Use a CSV as a teaching dataset for an intro stats or pandas class
Bootstrap a side project with vetted real-world journalism data
| fivethirtyeight/data | stefan-jansen/machine-learning-for-trading | ufund-me/qbot | |
|---|---|---|---|
| Stars | 17,359 | 17,322 | 17,322 |
| Language | Jupyter Notebook | Jupyter Notebook | Jupyter Notebook |
| Setup difficulty | easy | hard | hard |
| Complexity | 1/5 | 4/5 | 4/5 |
| Audience | data | data | data |
Figures from each repo's GitHub metadata at analysis time.
Just clone and open the CSVs in pandas or a spreadsheet, no install needed.
This repository is the public archive of data and code that powered the articles and charts published by FiveThirtyEight, a data journalism outlet. Each folder in the repository corresponds to a story or analysis, containing the raw data files and any code used to process or visualize them. The datasets cover topics FiveThirtyEight wrote about, including sports, politics, economics, and culture. An index file lists all available datasets alongside links to the accompanying articles. The data is released under the Creative Commons Attribution 4.0 license, meaning anyone can freely use and share it with attribution. The accompanying code is under the MIT License. Sports predictions and forecasts in the repository are no longer being updated as of June 2023. The rest of the data archive remains available as a historical record. You would use this if you are a student, journalist, or data analyst who wants to explore real-world datasets from published journalism, reproduce a FiveThirtyEight analysis, or use their data as a starting point for your own work.
Public archive of datasets and code behind FiveThirtyEight articles, covering politics, sports, economics, and culture. Each folder maps to one published story.
Mainly Jupyter Notebook. The stack also includes Jupyter, Python, R.
Data is free to use and share with attribution under CC BY 4.0, and the code is free to use under MIT.
Setup difficulty is rated easy, with roughly 5min to a first successful run.
Mainly data.
This repo across BitVibe Labs
Verify against the repo before relying on details.