explaingit

onlyterp/hermes-caduceus

16PythonAudience · developerComplexity · 3/5Setup · easy

TLDR

An optional add-on for the Hermes desktop AI agent app that adds structured step-by-step planning, a parallel workflow engine with a live visual panel, and automatic routing of sub-tasks to the cheapest capable AI model.

Mindmap

mindmap
  root((repo))
    What It Does
      Structured planning mode
      Parallel workflow engine
      Automatic model routing
    Features
      Caduceus step-by-step plan
      Loom workflow authoring
      Orchestration Theater UI
      Auto Router cost savings
    Tech Stack
      Python 3.11+
      Standard library only
    Use Cases
      Complex multi-step tasks
      Cost-optimized AI runs
    Setup
      Single install script
      Fully reversible
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Enable Caduceus mode in Hermes so the agent maintains a visible to-do plan and verifies each step before moving to the next.

USE CASE 2

Author a Hermes workflow that fans out sub-tasks to multiple agents in parallel and watch live progress in the Orchestration Theater panel.

USE CASE 3

Configure multiple AI models in Hermes and let the Auto Router assign each sub-task to the cheapest model that meets the capability bar.

USE CASE 4

Install hermes-caduceus with one script that backs up every file it touches and undo the whole thing with a single uninstall flag.

Tech stack

Python

Getting it running

Difficulty · easy Time to first run · 5min

Requires an existing Hermes desktop app installation, Python 3.11+ with standard library only, no extra packages needed.

In plain English

Hermes-caduceus is an optional add-on mode for Hermes, a desktop AI agent app made by Nous Research. If you already use Hermes, this project lets you turn on a more advanced planning style with a single command. When turned off, the fork behaves exactly like the original Hermes, so there is no penalty for installing it. The core addition is called Caduceus mode, which you activate by typing "/caduceus on" inside Hermes. Once on, the agent works through tasks by maintaining a visible to-do plan, completing one step at a time, and checking its own work before marking anything as done. For simple or quick requests, it skips the extra ceremony and responds normally, so the overhead only appears when a task genuinely calls for it. For more complex jobs, Caduceus includes a workflow engine called the Loom. Rather than running everything in one conversation thread, the agent can author a small Python workflow and run it across multiple sub-agents in parallel or in sequence. The Hermes desktop app shows this process in real time through what the project calls the Orchestration Theater: a visual panel with live phase lanes, per-agent status, token usage, and a shared budget counter. You watch the fan-out happen rather than waiting in silence. A third feature, the Auto Router, handles model selection when multiple AI models are configured. It scores each sub-task by capability requirements and sends that task to the cheapest model that meets the bar. The main orchestrator always keeps the model you chose for your session, so only the background workers get re-routed. Installation is a single git clone and one Python script, which auto-detects your Hermes install, backs up every file it touches, and can be fully undone with an uninstall flag. The project requires Python 3.11 or newer and uses only the standard library, with no additional packages to install. A local GPU worker option also exists for running workflow sub-agents on local models rather than cloud endpoints.

Copy-paste prompts

Prompt 1
I have Hermes installed and want to try Caduceus mode. What do I type to enable it and what changes in how the agent responds to complex tasks?
Prompt 2
I want to use the Loom workflow engine to process a list of 20 files in parallel using sub-agents. Show me how to author a Loom workflow for that.
Prompt 3
The Auto Router sends sub-tasks to cheaper models. How do I configure which models are available for routing and set a shared token budget limit?
Prompt 4
I need to fully uninstall hermes-caduceus and restore Hermes to its original state. What command do I run and which files does it restore?
Open on GitHub → Explain another repo

← onlyterp on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.