explaingit

mrcube3/boss

0PythonAudience · developerComplexity · 4/5ActiveSetup · hard

TLDR

A voice-first trading pipeline that transcribes a spoken command, classifies a chart image, has three LLMs vote on buy/sell/hold, and fires the order through the Kraken CLI.

Mindmap

mindmap
  root((boss))
    Inputs
      Voice command
      Chart image
      API keys
    Outputs
      Kraken order
      Regime label
      Consensus signal
    Use Cases
      Voice-driven trading demo
      LLM ensemble experiments
      Multimodal pipeline study
    Tech Stack
      FastAPI
      React
      Vite
      Docker

Things people build with this

USE CASE 1

Run a local voice-to-trade demo against the Kraken sandbox

USE CASE 2

Experiment with a 3-LLM voting ensemble for trade signals

USE CASE 3

Build a multimodal pipeline mixing audio, vision, and text models

USE CASE 4

Test Speechmatics and Gemini Vision in one FastAPI service

Tech stack

PythonFastAPIReactViteDocker

Getting it running

Difficulty · hard Time to first run · 1h+

Needs three paid API keys (Speechmatics, Gemini, Featherless) plus Kraken CLI to run anything beyond the sandbox fallback.

In plain English

This repository hosts a project the README calls llablab. It is described as a voice first multimodal trading engine. The idea is that a user speaks a command, the system reads a chart image they provide, multiple language models vote on what to do, and an order is sent to a crypto exchange. The README lays out four steps in the pipeline. First, an audio file is sent to Speechmatics, which transcribes the voice command into text. Second, a chart image is sent to Gemini Vision, which classifies the market regime as bullish, bearish, or range bound. Third, three language models accessed through a service called Featherless act as an ensemble and vote on a signal of buy, sell, or hold. Fourth, the Kraken command line tool is used to place the resulting order. The code is split into a backend and a frontend. The backend is a FastAPI service in Python 3.11 or newer that exposes endpoints for health checks, transcription, chart analysis, and running the full pipeline. The frontend is a React 19 app built with Vite 6, described as having a dark and light mode with a glassy iOS style interface. Running the project means starting the FastAPI server on port 8000 and the Vite dev server on port 3000, which proxies API calls to the backend. The README documents a small set of keyboard shortcuts in the UI, including keys for uploading a voice file, uploading a chart image, running the pipeline, resetting, toggling a help guide, and toggling dark mode. API keys for Speechmatics, Featherless, and Gemini are read from an .env file, and the README notes that services without valid keys fall back to a sandbox simulation rather than failing outright. Finally, the repo includes a Dockerfile. A user can build an image tagged voxregime-oracle and run it with the .env file mounted, exposing port 8501. The README does not go into detail about safety controls, position sizing, or risk limits on the trading side beyond describing the pipeline itself.

Copy-paste prompts

Prompt 1
Walk me through wiring my own Speechmatics and Gemini keys into the boss .env and starting both the FastAPI server and the Vite frontend
Prompt 2
Show me how to swap the Featherless ensemble in boss for a single local Ollama model and keep the buy/sell/hold output shape
Prompt 3
Explain how the /execute-pipeline endpoint in boss combines the transcript and chart regime into a Kraken order
Prompt 4
Help me add basic position sizing and a max-loss guard before the Kraken CLI step in boss
Prompt 5
Give me a curl example that hits /transcribe and /analyze-chart on boss and prints the consensus
Open on GitHub → Explain another repo

Generated 2026-05-22 · Model: sonnet-4-6 · Verify against the repo before relying on details.