explaingit

cjpais/handy

📈 Trending21,215RustAudience · vibe coderComplexity · 3/5ActiveLicenseSetup · moderate

TLDR

A free, private speech-to-text app that runs on your computer. Press a hotkey, speak, and Handy transcribes and pastes the text into any app, no cloud needed.

Mindmap

mindmap
  root((Handy))
    What it does
      Speech to text
      Auto-paste text
      Hotkey triggered
    How it works
      Whisper models
      Parakeet V3
      GPU acceleration
      Voice detection
    Platforms
      Windows
      macOS
      Linux
    Features
      Push-to-talk mode
      Toggle recording
      Custom shortcuts
      CLI control
    Tech stack
      Tauri framework
      React frontend
      Rust backend

Things people build with this

USE CASE 1

Transcribe voice notes hands-free while working in any application.

USE CASE 2

Build custom voice-controlled workflows using the CLI interface.

USE CASE 3

Deploy a private transcription service on your own hardware without cloud dependencies.

USE CASE 4

Integrate voice input into Raycast or other launcher tools on macOS.

Tech stack

RustTauriReactTypeScriptWhisperParakeet V3

Getting it running

Difficulty · moderate Time to first run · 30min

Requires Rust toolchain and Tauri dependencies; Whisper/Parakeet models need to download on first run.

Free and open-source; you can use, modify, and distribute it freely.

In plain English

Handy is a free, open source desktop application that turns spoken words into typed text, running entirely on your own computer. You press a configurable keyboard shortcut, speak, release, and your words appear in whatever text field you have focused, no audio is ever sent to the cloud. It is built for people who want speech-to-text without a subscription and without surrendering their voice to a third-party service. Under the hood, Handy first uses Voice Activity Detection with Silero to filter silence out of the recording, then runs the audio through your choice of local transcription models. You can pick Whisper models in several sizes (Small, Medium, Turbo, Large) with GPU acceleration when your hardware supports it, or Parakeet V3, a CPU-optimised model with automatic language detection. The transcribed text is then pasted directly into the app you were already using. Optional integrations like a Raycast extension let you control recording, browse history, and switch models from outside the app. You would reach for Handy when you want hands-free dictation into emails, code editors, chat apps, or notes, especially when privacy matters or you work offline. The project openly states it is not trying to be the best speech-to-text tool but the most forkable one, so it also appeals to developers who want to extend or rebuild a transcription stack themselves. The application is built with Tauri v2, combining a React and TypeScript frontend styled with Tailwind CSS for the settings interface, and a Rust backend for system integration, audio handling, and machine-learning inference. The full README is longer than what was provided.

Copy-paste prompts

Prompt 1
How do I set up Handy on my Windows machine and configure it to use the Whisper Turbo model?
Prompt 2
Show me how to create a custom keyboard shortcut in Handy and test it with the CLI.
Prompt 3
What's the difference between push-to-talk and toggle recording modes in Handy, and when should I use each?
Prompt 4
How do I integrate Handy with Raycast on macOS to trigger transcription from the launcher?
Prompt 5
What GPU acceleration options does Handy support, and how do I enable them for faster transcription?
Open on GitHub → Explain another repo

Generated 2026-05-21 · Model: sonnet-4-6 · Verify against the repo before relying on details.