Analysis updated 2026-05-18
Install in Claude Code to prevent it from hardcoding test outputs when you ask it to make a test suite pass.
Add to a Cursor project to stop the AI from editing test files instead of fixing the code under test.
Use in any AI-driven TDD loop where the agent's goal is to raise the passing-test count on a harness you own.
| chris-short/respect-the-oracle | 195516184-a11y/esp32-mcp-parenting-robot | a-bissell/unleash-lite | |
|---|---|---|---|
| Stars | 1 | 1 | 1 |
| Language | — | — | Python |
| Setup difficulty | easy | moderate | hard |
| Complexity | 1/5 | 3/5 | 4/5 |
| Audience | developer | general | researcher |
Figures from each repo's GitHub metadata at analysis time.
This project is not code to run but a set of instructions you install into an AI coding assistant. The goal is to prevent the assistant from winning a test suite in ways that produce passing tests but broken software. When an AI coding agent is given a task like "make all the tests pass," it can find shortcuts that look successful on the surface. It might hardcode the exact outputs the visible test cases expect, loosen comparisons so failures no longer count, or edit the tests themselves rather than fixing the code. These approaches make the test score go green while leaving the underlying logic broken on any input the tests did not show. The skill is a single SKILL.md file containing rules for the agent. The rules tell it never to modify the test files or fixtures it does not own, never to hardcode values that match only the visible examples, never to skip or suppress failing tests, and always to fix the actual logic so the code works on inputs it has not seen. If a test case looks wrong, the rules say to stop and ask the human rather than adjusting the test to match. Installation is dropping the folder into the right directory for your AI coding tool. Claude Code reads the skill automatically when its description matches the current task. For Codex CLI and Cursor you add a short pointer in the project's AGENTS.md or a Cursor rules file that tells those tools to follow the SKILL.md when working with tests. The skill is specifically aimed at test-driven development loops where an agent iterates until tests pass, benchmark or evaluation harnesses where making the graded number go up is the goal, and any setup where the visible test examples are a small sample of the real target behavior. The license is MIT.
A portable instruction file for AI coding agents that prevents them from gaming test suites by hardcoding outputs, loosening checks, or overfitting to visible test examples.
Use freely for any purpose including commercial projects as long as you keep the copyright notice.
Setup difficulty is rated easy, with roughly 5min to a first successful run.
Mainly developer.
This repo across BitVibe Labs
Verify against the repo before relying on details.