explaingit

lucidrains/disco-rl-pytorch

16PythonAudience · researcherComplexity · 4/5Setup · moderate

TLDR

A work-in-progress PyTorch implementation of DiscoRL, a method from a 2025 Nature paper by David Silver for automatically discovering which reinforcement learning algorithms perform best rather than relying on human-designed ones.

Mindmap

mindmap
  root((repo))
    What It Does
      DiscoRL algorithm
      Auto RL discovery
      Reference implementation
    Research Basis
      Nature 2025 paper
      David Silver DeepMind
      Test-time training link
    Tech Stack
      Python
      PyTorch
    Status
      Work in progress
      Minimal documentation
    Audience
      RL researchers
      AI paper readers
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Study the DiscoRL algorithm from the 2025 Nature paper by reading a clean PyTorch reference implementation.

USE CASE 2

Experiment with automated reinforcement learning algorithm discovery by running DiscoRL on your own environments.

Tech stack

PythonPyTorch

Getting it running

Difficulty · moderate Time to first run · 1h+

Work in progress with minimal documentation, requires reading the DiscoRL paper to understand the algorithm before using the code.

In plain English

This repository is a PyTorch implementation of DiscoRL, short for Discovering state-of-the-art reinforcement learning algorithms. The research it is based on was published in Nature in 2025 and represents the last work David Silver completed at DeepMind. Reinforcement learning is a field of AI where a system learns by trial and error, receiving rewards for good actions and penalties for bad ones, DiscoRL is a method for automatically discovering which learning algorithms perform best rather than relying on human-designed ones. The repository is marked as a work in progress, and the README is minimal: it contains a diagram, a brief description, and citation references for the underlying research paper and a related paper on test-time training. There is no setup guide, usage documentation, or code walkthrough provided at this stage. The project comes from lucidrains, a prolific open-source contributor known for implementing recent AI research papers in PyTorch as learning and reference resources.

Copy-paste prompts

Prompt 1
I am reading the DiscoRL Nature 2025 paper by David Silver. Walk me through how the disco-rl-pytorch implementation maps to the core algorithm described in the paper.
Prompt 2
I want to run disco-rl-pytorch on a custom Gym environment. What interface does my environment need to implement to plug in?
Prompt 3
The repository is marked as a work in progress. Which parts of the DiscoRL algorithm are already implemented and which are still missing or stubbed out?
Open on GitHub → Explain another repo

← lucidrains on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.