explaingit

ajaysoni-dev/ai-ds-100

Analysis updated 2026-06-24

32Audience · dataComplexity · 2/5LicenseSetup · easy

TLDR

Practice lab bundling 26 small data science Jupyter notebooks, each shipped with its dataset and a finished PDF, split into basic, intermediate, and advanced folders.

Mindmap

mindmap
  root((AI-DS-100))
    Inputs
      CSV datasets
      Jupyter notebooks
    Outputs
      Trained baseline model
      Evaluation metrics
      Reference PDF
    Use Cases
      Portfolio practice
      Resume projects
      Learning the data workflow
    Tech Stack
      Python
      Jupyter
      pandas
      scikit-learn
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

What do people build with it?

USE CASE 1

Practice the load-clean-train-evaluate workflow on 26 different real datasets

USE CASE 2

Pick a beginner project like Titanic or wine quality and compare your notebook output to the included PDF

USE CASE 3

Extend an advanced project such as crop yield or warranty fraud with stronger models for a portfolio piece

What is it built with?

PythonJupyterpandasnumpyscikit-learnmatplotlib

How does it compare?

ajaysoni-dev/ai-ds-100autolearnmem/automemchungyuandye/ntou_thesis
Stars323232
LanguagePythonTeX
Setup difficultyeasyhardmoderate
Complexity2/55/52/5
Audiencedataresearcherwriter

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · easy Time to first run · 30min

One pip install line covers the dependencies, but each project ships zipped so you unzip before opening the notebook.

MIT license, you can use, modify, and ship it as long as you keep the copyright notice.

In plain English

AI-DS-100 is a learning lab for people picking up AI and data science. It is a GitHub repository that bundles together 26 small practice projects, each one centred around a Jupyter notebook, the dataset that notebook uses, a PDF copy of the finished notebook, and a short description file. The repository name suggests an eventual goal of 100 projects, but the current version stops at 26. The 26 projects are split into three folders by difficulty. The basic folder has eight beginner projects such as Titanic survival, salary prediction, and red wine quality. The intermediate folder has twelve, covering things like customer churn, hotel booking cancellations, loan approval, and stroke risk. The advanced folder has six larger projects including used-car pricing in Belarus and India, crop yield forecasting, traffic-flow prediction, and warranty-claim fraud. Each project follows the same pattern, which is part of the point of the lab: load a dataset, look at the rows and missing values, clean and encode the data, plot a few charts, split into training and test sets, train a baseline model, score it with appropriate metrics, and write up the results. The notebooks rely on familiar Python libraries: pandas, numpy, matplotlib, and scikit-learn. To use the lab, you pick the difficulty folder that matches where you are, unzip a project, and open the notebook in Jupyter, JupyterLab, VS Code, Google Colab, or Kaggle. The README lists a single pip install line for the common dependencies. After running the notebook from top to bottom, you can compare your output with the PDF the author shipped, or extend it with extra evaluation or different models. The README is clear that the projects are deliberately kept simple and use baseline models, not production-grade pipelines. They are intended for portfolio practice, resume material, and getting used to applying the same workflow across different real-world datasets. The repository is released under the MIT licence.

Copy-paste prompts

Prompt 1
List the 26 projects in AI-DS-100 grouped by difficulty and tell me which one is best for a first attempt at classification
Prompt 2
Open the customer churn notebook in AI-DS-100 and walk me through the preprocessing it does on categorical features
Prompt 3
Help me swap the baseline model in the loan approval project for an XGBoost classifier and compare scores
Prompt 4
Suggest three extensions to the used-car pricing project that would make it portfolio-grade
Prompt 5
Write a requirements.txt for AI-DS-100 with pinned versions that match the notebooks

Frequently asked questions

What is ai-ds-100?

Practice lab bundling 26 small data science Jupyter notebooks, each shipped with its dataset and a finished PDF, split into basic, intermediate, and advanced folders.

What license does ai-ds-100 use?

MIT license, you can use, modify, and ship it as long as you keep the copyright notice.

How hard is ai-ds-100 to set up?

Setup difficulty is rated easy, with roughly 30min to a first successful run.

Who is ai-ds-100 for?

Mainly data.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Verify against the repo before relying on details.