Analysis updated 2026-06-20
Find the best free online courses to start learning data science from scratch, organized by skill level and topic.
Discover Python or R packages for a specific task, like model evaluation or data visualization, without having to search the whole internet.
Find data science competitions to practice on, communities to join, and newsletters to follow to stay current with the field.
| academic/awesome-datascience | wesbos/javascript30 | heroui-inc/heroui | |
|---|---|---|---|
| Stars | 29,134 | 29,114 | 29,112 |
| Language | — | HTML | TypeScript |
| Setup difficulty | easy | easy | easy |
| Complexity | 1/5 | 1/5 | 2/5 |
| Audience | data | developer | developer |
Figures from each repo's GitHub metadata at analysis time.
No code or installation required, it is a reference list, just read and follow the links.
Awesome Data Science is a curated reference list for anyone trying to learn data science or find tools and resources for applying it to real-world problems. Data science is the practice of extracting useful insights from large collections of information using a combination of statistics, programming, and domain knowledge. The repository does not contain code to run. Instead, it is a structured collection of links organized into categories. These include explanations of what data science is, guides on where to start, free and paid online courses, university programs, and a toolbox covering algorithms and packages. The algorithms section groups techniques by learning type: supervised learning (where you train a model on labeled examples), unsupervised learning (where the model finds patterns without labels), semi-supervised learning, reinforcement learning (where an agent learns by trial and reward), and deep learning architectures (neural network designs). The toolbox also covers packages for machine learning, model evaluation, visualization, and miscellaneous utilities. Beyond tools, the list links to books, journals, newsletters, blogs, podcasts, YouTube channels, Slack communities, and data science competitions. Someone would use this repository as a starting point when they want to get into data science but do not know which courses, books, or tools to begin with, or when they are already practicing and want to discover new resources, communities, or datasets they have not encountered before.
A curated mega-list of data science learning resources, courses, books, algorithms, tools, communities, and competitions, organized so beginners know where to start and practitioners discover what they've missed.
Not specified in the explanation.
Setup difficulty is rated easy, with roughly 5min to a first successful run.
Mainly data.
This repo across BitVibe Labs
Verify against the repo before relying on details.