explaingit

vocodedev/vocode-core

Analysis updated 2026-07-03

3,741PythonAudience · developerComplexity · 3/5Setup · moderate

TLDR

Vocode is a Python library for building AI voice agents that hold real-time spoken conversations, with swappable providers for speech-to-text, language models, and text-to-speech.

Mindmap

mindmap
  root((repo))
    Core Pipeline
      Speech to text
      Language model
      Text to speech
    Transcription Options
      Deepgram
      AssemblyAI
      Whisper
      Google and Azure
    Voice Synthesis
      ElevenLabs
      Play.ht
      AWS Polly
    Use Cases
      Microphone conversations
      Inbound phone calls
      Outbound calling
      Zoom meetings
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

What do people build with it?

USE CASE 1

Build an AI phone bot that picks up inbound calls and holds a natural spoken conversation with callers

USE CASE 2

Create a voice assistant that listens through your microphone and replies using a language model of your choice

USE CASE 3

Trigger outbound phone calls from code and have an AI agent conduct the conversation automatically

What is it built with?

PythonOpenAIAnthropicDeepgramElevenLabsAzureGoogle CloudAssemblyAI

How does it compare?

vocodedev/vocode-coreflasgger/flasggermrforexample/comfyui-3d-pack
Stars3,7413,7423,742
LanguagePythonPythonPython
Setup difficultymoderateeasymoderate
Complexity3/52/53/5
Audiencedeveloperdeveloperdesigner

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · moderate Time to first run · 30min

Requires API keys for at least one transcription provider, one language model, and one voice synthesis provider.

License information is not mentioned in the description.

In plain English

Vocode is an open source Python library for building voice-based AI agents that can hold real-time spoken conversations. You give it a microphone and speaker, connect it to a language model and a speech service, and it handles the real-time loop of listening, understanding, responding, and speaking, all in a streaming fashion so the conversation feels natural rather than clunky. The library is designed around three interchangeable pieces: a transcription service that converts the incoming audio to text, a language model that generates the reply, and a text-to-speech service that speaks the response. Each piece has multiple provider options. For transcription you can choose from Deepgram, AssemblyAI, Whisper, Google Cloud, Azure, and others. For language models the built-in options include OpenAI and Anthropic. For voice synthesis there are providers like ElevenLabs, Play.ht, Azure, Google Cloud, AWS Polly, and more. Swapping one provider for another is a configuration change, not a code rewrite. Beyond simple microphone conversations, Vocode can connect to phone calls. You can set up an inbound phone number that an AI agent picks up and talks to callers, or trigger outbound calls from code. It also supports dialing into Zoom meetings. The README includes a working Python code example showing how to wire together a streaming conversation using Deepgram for transcription, ChatGPT as the agent, and Azure for synthesis, using just a handful of imports. Installation is a single pip command. Full documentation lives at docs.vocode.dev, and there is a Discord community for contributors and users. The project is actively looking for community maintainers and describes itself as very open to contributions.

Copy-paste prompts

Prompt 1
Using Vocode, show me how to set up a real-time voice conversation that uses Deepgram for transcription, GPT-4 as the language model, and ElevenLabs for speech synthesis.
Prompt 2
I want to build an AI agent that answers inbound phone calls using Vocode. Walk me through setting up the phone number, connecting it to a language model, and handling the call loop.
Prompt 3
How do I swap the language model provider in a Vocode voice agent from OpenAI to Anthropic without changing anything else in the pipeline?
Prompt 4
Show me how to use Vocode to make outbound phone calls from a Python script, with an AI agent handling the conversation on the other end.

Frequently asked questions

What is vocode-core?

Vocode is a Python library for building AI voice agents that hold real-time spoken conversations, with swappable providers for speech-to-text, language models, and text-to-speech.

What language is vocode-core written in?

Mainly Python. The stack also includes Python, OpenAI, Anthropic.

What license does vocode-core use?

License information is not mentioned in the description.

How hard is vocode-core to set up?

Setup difficulty is rated moderate, with roughly 30min to a first successful run.

Who is vocode-core for?

Mainly developer.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub vocodedev on gitmyhub

Verify against the repo before relying on details.