Study the DiscoRL algorithm from the 2025 Nature paper by reading a clean PyTorch reference implementation.
Experiment with automated reinforcement learning algorithm discovery by running DiscoRL on your own environments.
Work in progress with minimal documentation, requires reading the DiscoRL paper to understand the algorithm before using the code.
This repository is a PyTorch implementation of DiscoRL, short for Discovering state-of-the-art reinforcement learning algorithms. The research it is based on was published in Nature in 2025 and represents the last work David Silver completed at DeepMind. Reinforcement learning is a field of AI where a system learns by trial and error, receiving rewards for good actions and penalties for bad ones, DiscoRL is a method for automatically discovering which learning algorithms perform best rather than relying on human-designed ones. The repository is marked as a work in progress, and the README is minimal: it contains a diagram, a brief description, and citation references for the underlying research paper and a related paper on test-time training. There is no setup guide, usage documentation, or code walkthrough provided at this stage. The project comes from lucidrains, a prolific open-source contributor known for implementing recent AI research papers in PyTorch as learning and reference resources.
← lucidrains on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.