Auto-extract highlight clips from a long podcast or lecture
Generate bilingual styled subtitles for short-form video
Batch transcribe a folder of recordings into SRT files
Export a Final Cut Pro timeline with caption clips ready to edit
Apple Silicon Mac only, plus ffmpeg via brew and Python 3.12; LLM features need an OpenAI-compatible API key.
pycut is a command line tool for Apple Silicon Macs that turns long videos or audio recordings into shorter, subtitled clips. You point it at a file, a folder, or a glob of files, and it transcribes the speech, optionally asks an AI to pick the most interesting moments, and then exports the result in whatever format you ask for, such as standard SRT subtitles, styled ASS subtitles, a Final Cut Pro timeline, a plain text transcript, a JSON transcript, or a finished video with the subtitles burned in. The transcription step runs locally using MLX backed speech recognition models. The default model is picked based on the source language, with Parakeet for English, Qwen3 for Chinese, and Whisper Large v3 Turbo for everything else. There is a separate forced alignment model that lines up the recognized words with their exact timestamps in the audio. Long files are broken into chunks for processing and models are unloaded between pipeline stages to keep memory use under control. For the highlight extraction, translation, and transcript correction features, you need to provide an API key for an OpenAI compatible service. The README shows examples of pointing the tool at Gemini or DeepSeek by setting a base URL and model name. Without a key the tool still works, you just get subtitles for the full timeline instead of automatically chosen highlights. The command line interface has a long list of options grouped into input and output, speech recognition, AI analysis, subtitle styling, and rendering. You can translate subtitles into a second language and stack them in a bilingual layout with different colors for the original, the translation, and highlighted keywords. You can switch the output between landscape and portrait orientation, set a frame rate for FCPXML exports, and pass an existing transcript JSON back in to skip the recognition step on a rerun. Installation is one command, uv tool install pointed at the GitHub URL, after a normal brew install ffmpeg. Requirements are macOS on Apple Silicon, Python 3.12 or newer, and ffmpeg on your PATH. The README is upfront that the tool refuses to run on Intel Macs or anything other than macOS. The rest of the document is mostly worked examples: extracting highlights with a rendered video, generating subtitles only, making bilingual subtitles, batch processing a folder, and using the LLM to correct speech recognition mistakes before exporting.
Generated 2026-05-22 · Model: sonnet-4-6 · Verify against the repo before relying on details.