explaingit

therealdk8890/dprovenancekitpython

Analysis updated 2026-05-18

1PythonAudience · developerComplexity · 3/5Setup · easy

TLDR

A Python library that records AI agent execution traces, diffs them against a golden baseline, and can block CI pipelines when an agent silently drops a step or adds a loop.

Mindmap

mindmap
  root((DProvenanceKit))
    Core workflow
      Record trace
      Save to SQLite
      Diff runs
      Gate in CI
    Anomaly detection
      Tool drop detection
      Looping detection
      Custom rule registry
    Integrations
      LangChain and LangGraph
      OpenAI Agents SDK
      LlamaIndex and CrewAI
      OpenTelemetry OTLP
    CI support
      GitHub Action
      GitLab CI template
      CLI gate command
    Outputs
      Trace diff report
      HTML shareable report
      PR diff comment
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

What do people build with it?

USE CASE 1

Record an AI agent's execution trace as a SQLite file and diff it against a previous run to detect dropped steps.

USE CASE 2

Set up a GitHub Action that automatically blocks a PR when an agent regresses from its golden baseline.

USE CASE 3

Detect tool-drop or looping regressions in a LangChain or OpenAI Agents SDK workflow using built-in anomaly rules.

USE CASE 4

Add reasoning observability to a plain Python AI pipeline without installing any third-party dependencies.

What is it built with?

PythonSQLiteOpenTelemetryLangChainOpenAI Agents SDK

How does it compare?

therealdk8890/dprovenancekitpythona-bissell/unleash-liteabhiinnovates/whatsapp-hr-assistant
Stars111
LanguagePythonPythonPython
Setup difficultyeasyhardhard
Complexity3/54/53/5
Audiencedeveloperresearcherdeveloper

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · easy Time to first run · 30min

In plain English

DProvenanceKit is a Python library that helps you catch when an AI agent quietly changes what it does between runs. When an agent drops a tool call, skips a verification step, or falls into a new loop, the change may never surface in the output text itself. This library records each run as a structured trace, lets you compare two runs side by side, and can block a pull request in your CI pipeline if the agent's behavior has drifted from a known-good baseline. The core workflow is: record a run, save it to a file, then diff later runs against that saved golden run. You wrap your existing code with a simple context manager. The trace file is an SQLite database stored on your machine with no external service required. When you run the same workflow again after making changes, the library compares the two traces and reports which steps were dropped, added, or reordered. It integrates with popular AI frameworks including LangChain, LangGraph, the OpenAI Agents SDK, LlamaIndex, and CrewAI through optional adapters. It also accepts traces from any OpenTelemetry-instrumented system. The core library has zero dependencies beyond the Python standard library, so adding it to an existing project is straightforward. For teams using automated pipelines, the library ships a command-line gate tool and a ready-made GitHub Action. These can fail a pull request automatically when a candidate run structurally diverges from the golden baseline, posting a diff comment to the PR. Two built-in anomaly rules detect the most common regressions: a tool call being dropped and an agent entering a loop where it was not looping before. The library is a Python port of an original Swift implementation. A separate hosted web dashboard for visualizing traces and managing multiple runs is available as a commercial service at dprovenance.dev, but the local open-source library works independently.

Copy-paste prompts

Prompt 1
I'm using DProvenanceKit with LangChain. Show me how to wrap my agent with the trace context manager, save the golden run, and diff a candidate run against it.
Prompt 2
I want to set up the DProvenanceKit GitHub Action to block PRs when my AI agent drops a step. Walk me through the Action configuration.
Prompt 3
My OpenAI Agents SDK agent is adding unexpected loops between runs. How do I use DProvenanceKit's looping anomaly rule to detect and fail CI on this?
Prompt 4
Show me how to use trace.explain() to print a human-readable summary of an agent run and trace.diff() to compare it with a saved golden baseline.
Prompt 5
I want to run dprovenancekit evaluate on my project. Explain what the Precision/Recall/F1 scores mean for regression detection in an AI agent.

Frequently asked questions

What is dprovenancekitpython?

A Python library that records AI agent execution traces, diffs them against a golden baseline, and can block CI pipelines when an agent silently drops a step or adds a loop.

What language is dprovenancekitpython written in?

Mainly Python. The stack also includes Python, SQLite, OpenTelemetry.

How hard is dprovenancekitpython to set up?

Setup difficulty is rated easy, with roughly 30min to a first successful run.

Who is dprovenancekitpython for?

Mainly developer.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub therealdk8890 on gitmyhub

Verify against the repo before relying on details.