kingbootoshi/rgr

★ 22TypeScriptAudience · developerComplexity · 2/5Setup · easy

Mindmap

mindmap
  root((rgr))
    What It Does
      Locks test snapshots
      Verifies test integrity
      Logs all events
    Tech Stack
      TypeScript
      Bun runtime
    Use Cases
      AI agent discipline
      Test-driven dev
    Workflow
      Red failing test
      Green implementation
      Refactor safely

mindmap root((rgr)) What It Does Locks test snapshots Verifies test integrity Logs all events Tech Stack TypeScript Bun runtime Use Cases AI agent discipline Test-driven dev Workflow Red failing test Green implementation Refactor safely

Click or tap to explore — scroll the page freely

Things people build with this

USE CASE 1

Enforce test-first discipline on AI coding agents like Claude Code or Codex so they cannot quietly change tests to make them pass.

USE CASE 2

Lock a failing test snapshot before writing any code and have RGR verify the test file was not altered during implementation.

USE CASE 3

Sign off on a task description with intent-contract before coding starts, then verify the final changes stayed within that original scope.

Tech stack

TypeScriptBun

Getting it running

Difficulty · easy Time to first run · 30min

Requires the Bun runtime, can be installed as a Claude Code or Codex plugin or run directly from a local clone.

License information is not described in the explanation.

In plain English

RGR is a command-line tool that enforces a specific coding discipline on AI coding agents. It follows the Red-Green-Refactor pattern, a practice where you write a failing test before writing any production code, prove the test actually fails, then write the implementation to make it pass, and only then clean up. RGR adds a layer of enforcement that prevents an AI agent from skipping or cheating these steps. The way it works: you initialize a goal, then run a command that records your failing test. RGR takes a snapshot of that test file and locks it with a hash. When you tell RGR the implementation is ready (the Green step), it checks that the test file has not changed since you recorded the Red state. If the agent quietly edited the test to make it easier to pass, RGR refuses to proceed. This makes it harder for an agent to produce results that look correct but are not actually verified. The tool also includes a companion plugin called intent-contract, which lets you sign a locked description of what a task is supposed to accomplish before any coding starts. Once locked, RGR can check that the final code changes stayed within the boundaries of that original description, not just that the tests pass. Together, the two tools are meant to let you accept an agent's work without having to trust the agent's honesty. RGR has no external dependencies and runs using Bun, a JavaScript runtime. It can be installed as a plugin for Claude Code or Codex, or cloned and run directly from a local checkout. Every run writes a log of events, snapshots, and diffs to a local directory so there is a record of what happened. The readme is thorough and includes installation steps, a full description of the workflow commands, examples of advanced replay modes, and a section on the threat model explaining what the tool can and cannot protect against.

Copy-paste prompts

Prompt 1

Set up rgr with Claude Code so that every coding task must start with a recorded failing test that I approve before the AI writes any implementation.

Prompt 2

I want to use rgr to catch an AI agent that is cheating my tests, walk me through the initialize, red, and green commands for a TypeScript project.

Prompt 3

Use intent-contract with rgr to lock a description of what a feature should do, then verify after coding that the changes matched that description.

Prompt 4

Help me configure rgr as a Codex plugin and walk through a full Red-Green-Refactor cycle for a new utility function.

Prompt 5

Show me how to use rgr's replay mode to review the snapshots and diffs from a previous coding session.

Open on GitHub → Explain another repo

← kingbootoshi on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.