Build a personal screen-aware AI tutor that answers questions about whatever is visible on your Mac screen using your voice.
Extend Clicky to use a different AI model or TTS provider by modifying the Cloudflare Worker and Swift API calls.
Use Clicky as a starting point for a macOS accessibility tool that highlights UI elements and explains them aloud.
Deploy the Cloudflare Worker to securely proxy your Anthropic, AssemblyAI, and ElevenLabs API keys without embedding them in the app.
Requires macOS 14.2 or later, Xcode 15 or later, a Cloudflare account, and paid API keys for Anthropic, AssemblyAI, and ElevenLabs.
Clicky is a macOS application that puts an AI assistant next to your cursor on screen. It can see what is on your screen, listen to you through your microphone, talk back to you using a synthesized voice, and visually point at specific elements on your display. The idea is to feel like having a real tutor sitting beside you while you work. The app runs as a menu bar item rather than appearing in the dock. When you press a keyboard shortcut (Control + Option), it takes a screenshot of your screen and listens to what you say. Your speech is transcribed in real time using a service called AssemblyAI, then sent along with the screenshot to Claude (an AI model by Anthropic) which generates a response. That response is read aloud using ElevenLabs, a text-to-speech service. Claude can also embed coordinates in its response that tell Clicky exactly where to move an on-screen pointer, so it can highlight a specific button or piece of UI while explaining it. The codebase is open source and structured in two parts: a Swift macOS app that handles the UI, audio capture, and screen capture, and a small Cloudflare Worker that acts as a proxy to hold your API keys securely so they are not embedded in the app itself. Setting it up yourself requires a Mac running macOS 14.2 or later, Xcode 15 or later, a free Cloudflare account, and API keys for Anthropic, AssemblyAI, and ElevenLabs. You deploy the worker to Cloudflare, update a few URLs in the Swift code to point to your deployed worker, then build and run the app in Xcode. The README suggests using Claude Code (a coding assistant tool) to walk through setup automatically by pasting a single prompt. The creator, Farza, has noted in an update from April 2026 that active development has moved to a private version, but the existing codebase is released under an MIT license and anyone is welcome to use, modify, or build on it.
← farzaa on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.