Analysis updated 2026-05-18
Run a Claude Code session where planning happens on Fable but file reads and boilerplate writing use cheaper models.
Get a cost report after each coding session showing actual spend versus what the same work would have cost on one model.
Dispatch coding tasks to OpenAI Codex or Google Gemini from inside a Claude Code session, using zero Anthropic tokens.
| dangerousyams/muxer | chrisor-dev/claude-autosync | kyr0/ornith-35b-fp8-e4m3-mtp | |
|---|---|---|---|
| Stars | 2 | 2 | 2 |
| Language | Shell | Shell | Shell |
| Setup difficulty | moderate | moderate | hard |
| Complexity | 3/5 | 3/5 | 5/5 |
| Audience | developer | developer | researcher |
Figures from each repo's GitHub metadata at analysis time.
Requires Claude Code with plugin support, external Codex and Gemini delegates need their respective CLIs installed and signed in separately.
Muxer is a plugin for Claude Code that routes different parts of a coding session to different AI models based on what the work actually requires. The idea is that an expensive model like Fable handles planning and judgment while cheaper models handle the execution: file exploration on Haiku, documentation and boilerplate on Sonnet, implementation on Opus. The orchestrating model stays in a lean main loop, only reading summaries and plans rather than raw files, which keeps its token cost low even at premium rates. The routing happens in three places. Each agent definition file includes a hard model assignment in its header, so a scout agent always runs on Haiku and a builder always runs on Opus regardless of what the main session uses. A startup hook injects a short policy into every session telling the orchestrator to delegate rather than do everything itself. A third guard hook catches Claude Code's built-in subagents (like file exploration) before they inherit the session's expensive model, and pins them to Opus instead. The plugin includes seven named agents: a scout for read-only exploration, a writer for low-risk edits and copy, a builder for implementation, a reviewer for adversarial verification, an oracle that escalates to Fable for hard decisions, and two external delegates that dispatch work to OpenAI Codex and Google Gemini through their command-line tools, consuming zero Anthropic tokens. The quality rules are deliberately conservative: never hand a whole big task to a cheap model at once, always build one example at a verified tier before fanning out, and never use a cheaper model to review work that a more expensive model built. A task that fails review twice at one tier gets redone at the tier above. After each session, a cost report compares actual spending against what the same work would have cost if run entirely on the main model. The comparison uses API list prices with cache-adjustment. The license is not mentioned in the README.
A Claude Code plugin that routes coding tasks to different AI models by cost and capability, keeping an expensive orchestrator in a lean planning loop while cheaper models do the execution work.
Mainly Shell. The stack also includes Shell, Claude Code, Claude API.
License not stated in the README.
Setup difficulty is rated moderate, with roughly 30min to a first successful run.
Mainly developer.
This repo across BitVibe Labs
Verify against the repo before relying on details.