Strip silence from audio files before sending them to a transcription model to reduce cost and processing time
Build a voice bot that only activates its audio pipeline when someone is actually speaking, cutting CPU and API costs
Clean up call-center or podcast recording datasets by automatically tagging speech versus silence segments before training
Add a lightweight voice-activity trigger to an IoT or mobile device so it only wakes up and processes audio when speech is detected
Silero VAD is a pre-trained machine learning model that detects when speech is present in an audio recording. Given an audio file or a stream, it returns timestamps marking where people are talking and where they are not. This is useful any time you need to separate speech from silence or background noise before passing audio on to a transcription system or another processing step. The model is small, around 2 megabytes, and fast: it processes a 30-millisecond audio chunk in under 1 millisecond on a single CPU core. It was trained on recordings covering more than 6,000 languages, which means it does not need to be retrained or adjusted for a specific language. It works at 8,000 Hz and 16,000 Hz sampling rates. Installation is a one-line pip command. The Python API is minimal: load the model, read an audio file, call get_speech_timestamps, and get back a list of time ranges where speech was detected. The underlying model can also run through ONNX runtime, which allows it to work in environments without PyTorch. Community-maintained wrappers exist for C++, Rust, Go, Java, and C#, as well as browser-based use through ONNX Runtime Web. Common use cases listed in the README include processing call-center recordings, building voice bots that only activate when someone speaks, cleaning audio datasets before training other models, and adding voice interfaces to IoT or mobile devices. The model is published under the MIT license. The README notes it has no telemetry, no registration requirement, no expiration date, and no vendor lock-in. It is also available on PyPI as a standard Python package.
← snakers4 on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.