mrcube3/boss

Analysis updated 2026-06-24

★ 0PythonAudience · developerComplexity · 4/5Setup · hard

Mindmap

mindmap
  root((boss))
    Inputs
      Voice command
      Chart image
      API keys
    Outputs
      Kraken order
      Regime label
      Consensus signal
    Use Cases
      Voice-driven trading demo
      LLM ensemble experiments
      Multimodal pipeline study
    Tech Stack
      FastAPI
      React
      Vite
      Docker

mindmap root((boss)) Inputs Voice command Chart image API keys Outputs Kraken order Regime label Consensus signal Use Cases Voice-driven trading demo LLM ensemble experiments Multimodal pipeline study Tech Stack FastAPI React Vite Docker

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Run a local voice-to-trade demo against the Kraken sandbox

USE CASE 2

Experiment with a 3-LLM voting ensemble for trade signals

USE CASE 3

Build a multimodal pipeline mixing audio, vision, and text models

USE CASE 4

Test Speechmatics and Gemini Vision in one FastAPI service

What is it built with?

PythonFastAPIReactViteDocker

How does it compare?

	mrcube3/boss	0xhassaan/nn-from-scratch	a-little-hoof/dsr
Stars	0	0	0
Language	Python	Python	Python
Setup difficulty	hard	moderate	hard
Complexity	4/5	4/5	5/5
Audience	developer	developer	researcher

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · hard Time to first run · 1h+

Needs three paid API keys (Speechmatics, Gemini, Featherless) plus Kraken CLI to run anything beyond the sandbox fallback.

In plain English

This repository hosts a project the README calls llablab. It is described as a voice first multimodal trading engine. The idea is that a user speaks a command, the system reads a chart image they provide, multiple language models vote on what to do, and an order is sent to a crypto exchange. The README lays out four steps in the pipeline. First, an audio file is sent to Speechmatics, which transcribes the voice command into text. Second, a chart image is sent to Gemini Vision, which classifies the market regime as bullish, bearish, or range bound. Third, three language models accessed through a service called Featherless act as an ensemble and vote on a signal of buy, sell, or hold. Fourth, the Kraken command line tool is used to place the resulting order. The code is split into a backend and a frontend. The backend is a FastAPI service in Python 3.11 or newer that exposes endpoints for health checks, transcription, chart analysis, and running the full pipeline. The frontend is a React 19 app built with Vite 6, described as having a dark and light mode with a glassy iOS style interface. Running the project means starting the FastAPI server on port 8000 and the Vite dev server on port 3000, which proxies API calls to the backend. The README documents a small set of keyboard shortcuts in the UI, including keys for uploading a voice file, uploading a chart image, running the pipeline, resetting, toggling a help guide, and toggling dark mode. API keys for Speechmatics, Featherless, and Gemini are read from an .env file, and the README notes that services without valid keys fall back to a sandbox simulation rather than failing outright. Finally, the repo includes a Dockerfile. A user can build an image tagged voxregime-oracle and run it with the .env file mounted, exposing port 8501. The README does not go into detail about safety controls, position sizing, or risk limits on the trading side beyond describing the pipeline itself.

Copy-paste prompts

Prompt 1

Walk me through wiring my own Speechmatics and Gemini keys into the boss .env and starting both the FastAPI server and the Vite frontend

Prompt 2

Show me how to swap the Featherless ensemble in boss for a single local Ollama model and keep the buy/sell/hold output shape

Prompt 3

Explain how the /execute-pipeline endpoint in boss combines the transcript and chart regime into a Kraken order

Prompt 4

Help me add basic position sizing and a max-loss guard before the Kraken CLI step in boss

Prompt 5

Give me a curl example that hits /transcribe and /analyze-chart on boss and prints the consensus

Frequently asked questions

What is boss?

A voice-first trading pipeline that transcribes a spoken command, classifies a chart image, has three LLMs vote on buy/sell/hold, and fires the order through the Kraken CLI.

What language is boss written in?

Mainly Python. The stack also includes Python, FastAPI, React.

How hard is boss to set up?

Setup difficulty is rated hard, with roughly 1h+ to a first successful run.

Who is boss for?

Mainly developer.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Verify against the repo before relying on details.