explaingit

facebookresearch/audiocraft

Analysis updated 2026-06-21

23,252Jupyter NotebookAudience · researcherComplexity · 4/5LicenseSetup · hard

TLDR

AudioCraft is a Meta research library that generates music and environmental sounds from text descriptions using AI, describe what you want to hear, and it produces a matching audio clip without any musical knowledge or instruments.

Mindmap

mindmap
  root((repo))
    What it does
      Text to music
      Text to sound effects
      AI audio generation
    Models included
      MusicGen
      AudioGen
      EnCodec
      AudioSeal
    Tech stack
      Python
      PyTorch
      Jupyter Notebook
    Use cases
      Background music
      Sound design
      Audio research
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

What do people build with it?

USE CASE 1

Generate background music for a video or app prototype by typing a description of the mood and instruments

USE CASE 2

Create environmental sound effects like rain, crowd noise, or footsteps from a text description

USE CASE 3

Fine-tune MusicGen on your own music dataset to build a custom genre-specific generator

USE CASE 4

Add an invisible watermark to AI-generated audio using AudioSeal to mark it as machine-produced

What is it built with?

PythonPyTorchJupyter Notebook

How does it compare?

facebookresearch/audiocraftspmallick/learnopencvdatawhalechina/llm-cookbook
Stars23,25222,90123,959
LanguageJupyter NotebookJupyter NotebookJupyter Notebook
Setup difficultyhardmoderatemoderate
Complexity4/52/52/5
Audienceresearcherdeveloperdeveloper

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · hard Time to first run · 1h+

Requires Python 3.9, PyTorch, and a GPU for reasonable inference speed, model weights are non-commercial use only.

Model weights are available for non-commercial research use only, check each model's individual license before any commercial application.

In plain English

AudioCraft is a research library from Meta (Facebook Research) that lets you generate audio and music using AI. Give it a text description like "upbeat jazz with piano and drums" and it produces a matching audio clip, no musical knowledge or instruments needed. The library bundles several AI models. MusicGen generates music from text descriptions and can also follow a melody you hum or upload. AudioGen does the same for environmental sounds, things like rain, crowd noise, or footsteps. EnCodec is a neural audio compressor that converts audio into a compact form and back, which the other models use internally. There is also AudioSeal for adding invisible watermarks to AI-generated audio, and JASCO for music generation guided by specific chords, melodies, or drum patterns. Under the hood everything is built on PyTorch, a popular framework for deep learning research. The models are pre-trained, so you can run them without training anything yourself, just install the library and call the model with your text prompt. Training code is also included for researchers who want to fine-tune or build on top of these models. You would use AudioCraft when prototyping apps that need background music generation, when doing audio research, or when experimenting with AI-generated sound design. It requires Python 3.9 and PyTorch. Model weights are available for non-commercial use under a separate license.

Copy-paste prompts

Prompt 1
Write Python code using AudioCraft's MusicGen to generate a 30-second upbeat electronic track and save it as a WAV file
Prompt 2
Show me how to use AudioCraft's AudioGen to produce rain and crowd noise sound effects from text prompts
Prompt 3
How do I use MusicGen's melody-guided mode to generate music that follows a melody I upload?
Prompt 4
What Python environment and dependencies do I need to install AudioCraft and run MusicGen on a GPU?
Prompt 5
How do I fine-tune MusicGen on my own dataset of 10-second music clips using AudioCraft's training code?

Frequently asked questions

What is audiocraft?

AudioCraft is a Meta research library that generates music and environmental sounds from text descriptions using AI, describe what you want to hear, and it produces a matching audio clip without any musical knowledge or instruments.

What language is audiocraft written in?

Mainly Jupyter Notebook. The stack also includes Python, PyTorch, Jupyter Notebook.

What license does audiocraft use?

Model weights are available for non-commercial research use only, check each model's individual license before any commercial application.

How hard is audiocraft to set up?

Setup difficulty is rated hard, with roughly 1h+ to a first successful run.

Who is audiocraft for?

Mainly researcher.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub facebookresearch on gitmyhub

Verify against the repo before relying on details.