Analysis updated 2026-06-21
Generate background music for a video or app prototype by typing a description of the mood and instruments
Create environmental sound effects like rain, crowd noise, or footsteps from a text description
Fine-tune MusicGen on your own music dataset to build a custom genre-specific generator
Add an invisible watermark to AI-generated audio using AudioSeal to mark it as machine-produced
| facebookresearch/audiocraft | spmallick/learnopencv | datawhalechina/llm-cookbook | |
|---|---|---|---|
| Stars | 23,252 | 22,901 | 23,959 |
| Language | Jupyter Notebook | Jupyter Notebook | Jupyter Notebook |
| Setup difficulty | hard | moderate | moderate |
| Complexity | 4/5 | 2/5 | 2/5 |
| Audience | researcher | developer | developer |
Figures from each repo's GitHub metadata at analysis time.
Requires Python 3.9, PyTorch, and a GPU for reasonable inference speed, model weights are non-commercial use only.
AudioCraft is a research library from Meta (Facebook Research) that lets you generate audio and music using AI. Give it a text description like "upbeat jazz with piano and drums" and it produces a matching audio clip, no musical knowledge or instruments needed. The library bundles several AI models. MusicGen generates music from text descriptions and can also follow a melody you hum or upload. AudioGen does the same for environmental sounds, things like rain, crowd noise, or footsteps. EnCodec is a neural audio compressor that converts audio into a compact form and back, which the other models use internally. There is also AudioSeal for adding invisible watermarks to AI-generated audio, and JASCO for music generation guided by specific chords, melodies, or drum patterns. Under the hood everything is built on PyTorch, a popular framework for deep learning research. The models are pre-trained, so you can run them without training anything yourself, just install the library and call the model with your text prompt. Training code is also included for researchers who want to fine-tune or build on top of these models. You would use AudioCraft when prototyping apps that need background music generation, when doing audio research, or when experimenting with AI-generated sound design. It requires Python 3.9 and PyTorch. Model weights are available for non-commercial use under a separate license.
AudioCraft is a Meta research library that generates music and environmental sounds from text descriptions using AI, describe what you want to hear, and it produces a matching audio clip without any musical knowledge or instruments.
Mainly Jupyter Notebook. The stack also includes Python, PyTorch, Jupyter Notebook.
Model weights are available for non-commercial research use only, check each model's individual license before any commercial application.
Setup difficulty is rated hard, with roughly 1h+ to a first successful run.
Mainly researcher.
This repo across BitVibe Labs
Verify against the repo before relying on details.