wdnmd1265/ai-flow-architect

Analysis updated 2026-06-24

★ 25PythonAudience · developerComplexity · 4/5LicenseSetup · moderate

Mindmap

mindmap
  root((AI Flow Architect))
    Inputs
      Task request
      API keys
      Project context
      Generated output
    Outputs
      Approved blueprint
      Audit verdict
      Findings list
      Evidence chain
    Pipeline
      Planner brain
      Opponent attacks
      Expert agents
      Arbiter brain
    Tech Stack
      Python
      OpenAI
      Anthropic
      Ollama
      DeepSeek

mindmap root((AI Flow Architect)) Inputs Task request API keys Project context Generated output Outputs Approved blueprint Audit verdict Findings list Evidence chain Pipeline Planner brain Opponent attacks Expert agents Arbiter brain Tech Stack Python OpenAI Anthropic Ollama DeepSeek

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Run an LLM design proposal through a security and cost critique before writing code

USE CASE 2

Drop TrustEngine into an existing app to audit AI-generated output with a pass review reject verdict

USE CASE 3

Pair OpenAI and Anthropic models so each one critiques the other's plan

USE CASE 4

Produce a SHA-256 evidence chain showing which model said what during an AI task

What is it built with?

PythonOpenAIAnthropicOllamaDeepSeek

How does it compare?

	wdnmd1265/ai-flow-architect	arlandaren/proagents	audietoffe/plasma-gpu-router
Stars	25	25	25
Language	Python	Python	Python
Setup difficulty	moderate	easy	hard
Complexity	4/5	1/5	3/5
Audience	developer	vibe coder	ops devops

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · moderate Time to first run · 30min

Needs Python 3.9+ and at least one LLM API key, cross-provider pairing is recommended but adds a second key and billing setup.

Apache 2.0 permits commercial use, modification, and distribution with attribution and an explicit patent grant.

In plain English

AI Flow Architect is a Python framework for running an AI task through more than one model on purpose, so the models can check each other's work. The tagline in the README is "AI proposes. You decide." The author opens by saying that a single language model has no way to catch its own blind spots, and gives the example of asking GPT-4 to design a login system and getting back code that uses MD5 for password hashing with no rate limiting. The project's response to that is to wire two independent AI brains, ideally from different providers, into a fixed pipeline that you approve at each step. The full framework, called FlowArchitect, runs a task in stages. Brain 1, the planner, takes your request and produces a step-by-step blueprint with risk notes. An "opponent brain" then attacks that blueprint from five different angles: security audit, cost, user empathy, data rigor, and minimalism. You review and approve the plan, after which an "expert team" of session-isolated agents (creative, evaluator, programmer, reviewer) carries out the work. Finally Brain 2, the arbiter, run on a different model, compares the finished output against the original blueprint line by line and produces a quality report. There is also a standalone piece called TrustEngine you can drop into an existing project without the rest of the framework. You call engine.audit with the original requirement, the AI-generated output, and a context object describing the project. You get back a verdict (pass, review, or reject), a confidence score, findings with severity, risk points, an "uncertainty" section where the engine states what it does not know, optional votes from several arbiters, and a SHA-256 hashed evidence chain. The workflow is fixed rather than free-form, which the author treats as a feature: every task follows the same gated pipeline. One API key is enough to start because Brain 2 will auto-pick a cheaper model from the same provider, but the README says cross-provider pairing, for example OpenAI plus Anthropic, gives the strongest checks because the two models have different training data and failure modes. OpenAI (gpt-4o family, gpt-4-turbo, gpt-3.5-turbo) and Anthropic (Claude 3.5 Sonnet, 3.5 Haiku, 3 Opus) are marked production-tested. DashScope, Zhipu GLM, Moonshot, DeepSeek, and local Ollama models are listed as community-ready through OpenAI-compatible protocol but needing user verification. The project is alpha, Python 3.9 or newer, Apache 2.0 licensed, with 177 tests passing.

Copy-paste prompts

Prompt 1

Set up FlowArchitect with one OpenAI key and one Anthropic key so Brain 1 is gpt-4o and Brain 2 is Claude 3.5 Sonnet

Prompt 2

Show me how to call TrustEngine.audit on a code snippet my agent just produced, passing a project context for a Django web app

Prompt 3

Configure ai-flow-architect to use a local Ollama model as Brain 2 via the OpenAI-compatible endpoint

Prompt 4

Write a Python script that pipes a user requirement through the planner, opponent, expert team, and arbiter stages and prints the final quality report

Prompt 5

Add a custom opponent angle to FlowArchitect that attacks the blueprint from an accessibility perspective

Frequently asked questions

What is ai-flow-architect?

Python framework that runs an AI task through a fixed multi-stage pipeline with two independent models cross-checking each other's work.

What language is ai-flow-architect written in?

Mainly Python. The stack also includes Python, OpenAI, Anthropic.

What license does ai-flow-architect use?

Apache 2.0 permits commercial use, modification, and distribution with attribution and an explicit patent grant.

How hard is ai-flow-architect to set up?

Setup difficulty is rated moderate, with roughly 30min to a first successful run.

Who is ai-flow-architect for?

Mainly developer.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Verify against the repo before relying on details.