explaingit

owid/covid-19-data

5,664PythonAudience · researcherComplexity · 1/5Setup · easy

TLDR

Archived dataset of COVID-19 cases, deaths, hospitalizations, testing, and vaccination figures for every country worldwide, collected daily by Our World in Data from 2020 through August 2024.

Mindmap

mindmap
  root((COVID-19 Data))
    Metrics tracked
      Cases and deaths
      Hospitalizations
      Testing rates
      Vaccinations
    Coverage
      Every country
      Daily granularity
      2020 to 2024
    Files
      Ready-to-use CSVs
      Processing scripts
    Status
      Archived Aug 2024
      Read-only snapshot
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Download ready-to-use CSV files of historical COVID-19 case, death, and vaccination data for any country for academic research

USE CASE 2

Study the data pipeline scripts to understand how Our World in Data collected and harmonized pandemic statistics from dozens of national sources

USE CASE 3

Use the complete historical snapshot for retrospective analysis of pandemic trends across countries through August 2024

Tech stack

PythonCSV

Getting it running

Difficulty · easy Time to first run · 5min

Repository is archived with no new data after August 2024, use the linked live OWID data catalog for current figures.

In plain English

This repository was the main home for Our World in Data's COVID-19 dataset, which tracked cases, deaths, hospitalisations, testing, and vaccinations across every country in the world. The data was updated daily throughout the pandemic and was widely used by researchers, journalists, and governments. The repository is now archived: the last update was in August 2024, and no new data will be added here. The project is split into two parts. The first is a folder of final, ready-to-use data files in CSV and other formats, intended for anyone who simply wants the numbers. The second is the full pipeline of scripts that produced those files, which is for people who want to understand how the data was collected and processed, or who want to contribute to the methodology. Because this repository is no longer maintained, the README directs people who want current COVID-19 figures to a separate, live data catalog that Our World in Data continues to update. That resource also provides documentation about what each metric means and how to access any of their other datasets programmatically. For anyone who used this repository for research or automated data pipelines, the archived snapshot still contains a complete historical record up to the cutoff date. The data methodology and variable definitions are documented both inside the repository and in external technical documentation linked from the README.

Copy-paste prompts

Prompt 1
Load the owid/covid-19-data main CSV into pandas and plot the vaccination rate over time for three countries side by side
Prompt 2
What columns are available in the Our World in Data COVID-19 CSV and what does each variable represent, give me a plain-English column guide
Prompt 3
Show me how to use this dataset to calculate and compare the case fatality rate by country during the first wave of 2020
Prompt 4
This repository is archived, where do I find the current Our World in Data COVID dataset and how do I access it programmatically to continue an analysis?
Prompt 5
Using the owid COVID-19 CSV, write a Python script that finds the 10 countries with the highest peak daily deaths per million population
Open on GitHub → Explain another repo

← owid on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.