explaingit

going-doer/paper2code

4,601PythonAudience · researcherComplexity · 3/5Setup · moderate

TLDR

An AI system that reads a machine learning research paper as a PDF or LaTeX file and automatically generates a working code repository implementing the paper's methods using a multi-agent pipeline.

Mindmap

mindmap
  root((repo))
    What it does
      Paper to code
      AI code generation
    Input formats
      PDF papers
      LaTeX source
    AI pipeline
      Planning agent
      Analysis agent
      Code generation agent
    Tech stack
      Python
      OpenAI API
      vLLM
    Evaluation
      Reference-free scoring
      HuggingFace benchmark
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Turn a machine learning research paper PDF into a runnable code repository automatically

USE CASE 2

Prototype an ML paper implementation without writing all the boilerplate from scratch

USE CASE 3

Run locally using open-source models via vLLM to generate paper code without an OpenAI API key

USE CASE 4

Evaluate how well AI-generated code matches a reference implementation using the included benchmark

Tech stack

PythonOpenAI APIvLLM

Getting it running

Difficulty · moderate Time to first run · 1h+

Requires an OpenAI API key (roughly $0.50-0.70 per paper with o3-mini) or a local vLLM setup with DeepSeek-Coder.

In plain English

Paper2Code is a research project from a team at ICLR 2026 that attempts to automatically turn machine learning research papers into working code repositories. The core system, called PaperCoder, takes a paper as input, either as a PDF or as LaTeX source files, and produces a folder of code that implements the methods described in the paper. The system works in three stages handled by multiple AI agents. A planning agent reads the paper and lays out what needs to be built. An analysis agent examines the technical details of the methods. A code generation agent then writes the actual code. The result is a structured output directory containing planning notes, analysis artifacts, and the final generated repository. To use it, you need an API key for an AI provider. The default supported option is OpenAI, where running the system on a single paper costs roughly fifty to seventy cents using the o3-mini model. The project also supports running open-source language models locally using a framework called vLLM, with DeepSeek-Coder as the default model for that path. Instructions in the README walk through the steps to convert a PDF into the JSON format the system expects, or you can feed it LaTeX directly. The README includes an example using the well-known "Attention Is All You Need" paper, which introduced the Transformer model. It also describes an evaluation framework for scoring how well the generated code matches a reference implementation, using either a reference-free approach (judged by the AI alone against the paper) or a reference-based approach (compared against the original authors' published code). The project also released a benchmark dataset on HuggingFace called paper2code, which pairs machine learning papers with their corresponding official code repositories for evaluation purposes.

Copy-paste prompts

Prompt 1
I have a machine learning paper PDF and want to use going-doer/paper2code to generate code from it. Walk me through converting the PDF and running PaperCoder end to end.
Prompt 2
Using paper2code with the o3-mini model, what does the planning phase produce and how does it decide what files and functions to generate?
Prompt 3
I want to run going-doer/paper2code with a local DeepSeek-Coder model via vLLM instead of the OpenAI API. Show me how to configure that.
Prompt 4
Help me use the paper2code evaluation framework to score how well the generated code matches the original authors' published implementation.
Open on GitHub → Explain another repo

← going-doer on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.