explaingit

cjpais/handy

Analysis updated 2026-06-21

21,215RustAudience · generalComplexity · 3/5Setup · moderate

TLDR

Handy is a privacy-focused desktop app that transcribes your voice to text locally on your computer, press a hotkey, speak, and your words appear in any text field, with no audio sent to any server.

Mindmap

mindmap
  root((repo))
    What it does
      Voice to text
      Local processing
      Any text field
    Tech Stack
      Rust Tauri
      React TypeScript
      Tailwind CSS
    Models
      Whisper family
      Parakeet V3
      GPU or CPU
    Features
      Global hotkey
      CLI interface
      Cross-platform
    Audience
      Writers
      Developers
      Accessibility users
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

What do people build with it?

USE CASE 1

Dictate text into any app on your computer using a hotkey, with full privacy and no cloud dependency.

USE CASE 2

Transcribe voice notes for accessibility or hands-free writing without sending audio to external servers.

USE CASE 3

Wire Handy's command-line interface into automation tools like Raycast to trigger dictation from a script.

USE CASE 4

Use GPU-accelerated Whisper models for fast, accurate local transcription in any language.

What is it built with?

RustTypeScriptReactTauriTailwind CSS

How does it compare?

cjpais/handyvaleriansaliou/sonicnikivdev/flow
Stars21,21521,20521,138
LanguageRustRustRust
Setup difficultymoderatemoderateeasy
Complexity3/53/52/5
Audiencegeneraldeveloperdeveloper

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · moderate Time to first run · 30min

Install via Homebrew or winget for easiest setup, building from source requires Rust and Tauri toolchain.

In plain English

Handy is a free, open source desktop application that turns spoken words into typed text, running entirely on your own computer. You press a configurable keyboard shortcut, speak, release, and the words you said appear in whatever text field you were focused on. Because the work is done locally, your voice never leaves your machine, which is the project's main pitch: a privacy-focused, offline speech-to-text tool that anyone can use, fork, or extend. Under the hood, Handy is built as a Tauri application, meaning the settings interface is a web frontend written in React and TypeScript with Tailwind CSS, while the heavy lifting (audio capture, system integration, and the actual speech recognition) is done in Rust. When you talk, Handy uses Voice Activity Detection through a model called Silero to filter silence, then transcribes the audio with one of two model families: OpenAI's Whisper models (Small, Medium, Turbo, or Large, with GPU acceleration when available) or Parakeet V3, a CPU-friendly model that detects language automatically. Audio plumbing, global keyboard shortcuts, and resampling are handled by smaller Rust libraries like cpal, rdev, and rubato. There is also a command-line interface for toggling recording, cancelling, or launching the app hidden, which makes Handy convenient to wire into automation scripts or tools like Raycast. It is cross-platform, with builds for Windows, macOS, and Linux, and is installable through Homebrew or winget as well as direct downloads. The natural users are people who want quick dictation in any app without depending on a cloud service: writers, developers, accessibility users, and anyone wary of sending audio to external servers.

Copy-paste prompts

Prompt 1
How do I install Handy on macOS using Homebrew and configure it to use the Whisper Turbo model?
Prompt 2
Show me how to set a Handy keyboard shortcut that types transcribed text directly into VS Code while I code.
Prompt 3
How do I wire Handy's CLI into a Raycast script to trigger voice dictation from a Raycast command?
Prompt 4
How do I switch between the Whisper and Parakeet V3 models in Handy, and what are the trade-offs for each?
Prompt 5
How can I build Handy from source on Linux and configure it for my audio hardware?

Frequently asked questions

What is handy?

Handy is a privacy-focused desktop app that transcribes your voice to text locally on your computer, press a hotkey, speak, and your words appear in any text field, with no audio sent to any server.

What language is handy written in?

Mainly Rust. The stack also includes Rust, TypeScript, React.

How hard is handy to set up?

Setup difficulty is rated moderate, with roughly 30min to a first successful run.

Who is handy for?

Mainly general.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub cjpais on gitmyhub

Verify against the repo before relying on details.