ddbourgin/numpy-ml

Analysis updated 2026-06-24

★ 16,340PythonAudience · researcherComplexity · 3/5Setup · easy

Mindmap

mindmap
  root((numpy-ml))
    Inputs
      NumPy arrays
      Training datasets
      Gym environments
    Outputs
      Trained models
      Predictions
      Reference implementations
    Use Cases
      Study ML algorithm internals
      Prototype a new model
      Teach a course
      Read clean RL code
    Tech Stack
      Python
      NumPy
      OpenAI Gym

mindmap root((numpy-ml)) Inputs NumPy arrays Training datasets Gym environments Outputs Trained models Predictions Reference implementations Use Cases Study ML algorithm internals Prototype a new model Teach a course Read clean RL code Tech Stack Python NumPy OpenAI Gym

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Read clean from-scratch implementations of LSTM, attention, and gradient boosting

USE CASE 2

Prototype and modify a Q-learning or Dyna-Q agent against an OpenAI Gym env

USE CASE 3

Use as a reference when implementing ML algorithms for a class or paper

What is it built with?

PythonNumPyOpenAI Gym

How does it compare?

	ddbourgin/numpy-ml	meta-llama/codellama	nvidia/megatron-lm
Stars	16,340	16,327	16,322
Language	Python	Python	Python
Setup difficulty	easy	hard	hard
Complexity	3/5	4/5	5/5
Audience	researcher	developer	researcher

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · easy Time to first run · 30min

Reinforcement-learning modules need OpenAI Gym installed separately.

In plain English

numpy-ml is a Python library that implements a wide range of machine learning algorithms using only NumPy, a fundamental Python library for numerical computing. Most production ML tools hide all the math behind high-level abstractions, which is great for building things fast but makes it hard to understand what is actually happening. numpy-ml takes the opposite approach: the code is intentionally readable and "somewhat legible" (as the README itself puts it), not optimized for speed, making it a learning and experimentation resource. The library covers an impressive breadth of algorithms across many categories: neural network layers (including LSTM, attention, convolution, and normalization layers), classical machine learning models (decision trees, random forests, gradient boosting, linear and logistic regression), probabilistic models (Gaussian mixture models, hidden Markov models, Bayesian regression), reinforcement learning agents (Q-learning, Monte Carlo, Dyna-Q), and preprocessing utilities (text tokenization, Fourier transforms, feature encoding). You would use numpy-ml if you are studying how machine learning algorithms work under the hood, for a course, research, or to build intuition before using a larger production framework. It is also useful as a prototyping sandbox where you can experiment with and modify algorithms without fighting a complex codebase. It installs as a simple Python package via pip. The reinforcement learning models require the OpenAI gym environment, which can be installed alongside it.

Copy-paste prompts

Prompt 1

Walk me through the numpy-ml attention layer code and explain the math line by line

Prompt 2

Compare numpy-ml's gradient boosting implementation with scikit-learn's API

Prompt 3

Train a numpy-ml HMM on a small text dataset and show the forward-backward output

Prompt 4

Run the numpy-ml Q-learning agent on CartPole via OpenAI Gym and plot the reward curve

Frequently asked questions

What is numpy-ml?

A Python library of from-scratch ML algorithms written only in NumPy. Designed to be readable for learning rather than fast for production.

What language is numpy-ml written in?

Mainly Python. The stack also includes Python, NumPy, OpenAI Gym.

How hard is numpy-ml to set up?

Setup difficulty is rated easy, with roughly 30min to a first successful run.

Who is numpy-ml for?

Mainly researcher.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub ddbourgin on gitmyhub

Verify against the repo before relying on details.