automatic1111/stable-diffusion-webui

Analysis updated 2026-06-20

★ 162,744PythonAudience · generalComplexity · 4/5Setup · hard

Mindmap

mindmap
  root((stable-diffusion-webui))
    What it does
      Text to image
      Image editing
      Local AI art
    Generation Modes
      txt2img
      img2img
      Inpainting
    Tools
      Face restoration
      Upscalers
      Checkpoint merger
    Tech Stack
      Python
      Gradio
    Setup
      NVIDIA GPU
      AMD Intel GPU
      Google Colab

mindmap root((stable-diffusion-webui)) What it does Text to image Image editing Local AI art Generation Modes txt2img img2img Inpainting Tools Face restoration Upscalers Checkpoint merger Tech Stack Python Gradio Setup NVIDIA GPU AMD Intel GPU Google Colab

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Generate images from text prompts on your own machine using Stable Diffusion without any coding.

USE CASE 2

Edit an existing image using inpainting or outpainting to add, remove, or extend parts of it.

USE CASE 3

Upscale a photo or restore faces using built-in tools like RealESRGAN, GFPGAN, or CodeFormer.

USE CASE 4

Run parameter experiments comparing different seeds, prompts, or sampler settings using the X/Y/Z plot feature.

What is it built with?

PythonGradio

How does it compare?

	automatic1111/stable-diffusion-webui	yt-dlp/yt-dlp	huggingface/transformers
Stars	162,744	160,821	160,308
Language	Python	Python	Python
Setup difficulty	hard	easy	moderate
Complexity	4/5	2/5	3/5
Audience	general	developer	developer

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · hard Time to first run · 1h+

Requires a dedicated GPU (NVIDIA, AMD, Intel, or Ascend), minimum 4 GB VRAM, separate installation guides per hardware type.

License not stated in the explanation.

In plain English

Stable Diffusion web UI is a web interface for Stable Diffusion, the open-source AI model that turns text prompts into images. The README opens by saying the project is implemented using the Gradio library, which is a Python toolkit for putting a browser-based form in front of a machine-learning model. Instead of editing scripts and calling the model from a terminal, you fill in a prompt, pick options, and hit Generate. The features the README lists give a sense of what it covers. There are the original txt2img and img2img modes, plus outpainting, inpainting, color sketch, prompt matrix, an Extras tab for face restoration and upscaling using tools like GFPGAN, CodeFormer, RealESRGAN, ESRGAN, SwinIR and LDSR, a Checkpoint Merger that combines up to three model checkpoints, an X/Y/Z plot for comparing parameters, textual inversion training, hypernetworks, Loras, attention syntax that lets you weight parts of a prompt, negative prompts, styles, prompt editing mid-generation, batch processing, a built-in API, and many community extensions through custom scripts. It supports Stable Diffusion 2.0 and Alt-Diffusion, loads checkpoints in safetensors format, and works on as little as 4GB of video memory according to the README. You would actually use it if you want to generate or edit images locally on your own machine and you want a friendlier interface than the command line. The installation section lists guides for NVidia, AMD, Intel and Ascend NPU hardware, plus online services like Google Colab if you do not have a suitable GPU. The tech stack is Python with Gradio, and the project is centered on running Stable Diffusion locally.

Copy-paste prompts

Prompt 1

I have Stable Diffusion web UI running locally. Write a prompt that generates a photorealistic portrait of a person in a forest at golden hour, with recommended settings.

Prompt 2

How do I use the img2img mode in Stable Diffusion web UI to turn a rough pencil sketch into a finished digital illustration?

Prompt 3

Walk me through using the inpainting feature in Stable Diffusion web UI to replace the background of a product photo without touching the subject.

Prompt 4

Show me how to load and use a Lora in Stable Diffusion web UI to apply a specific art style like Studio Ghibli to my image generations.

Prompt 5

I want to upscale a low-resolution image using Stable Diffusion web UI. Which upscaler should I use and what settings give the best results?

Frequently asked questions

What is stable-diffusion-webui?

Stable Diffusion web UI is a browser-based interface for running the Stable Diffusion AI image generator locally, type a text prompt and hit Generate to create or edit images on your own machine, no command line needed.

What language is stable-diffusion-webui written in?

Mainly Python. The stack also includes Python, Gradio.

What license does stable-diffusion-webui use?

License not stated in the explanation.

How hard is stable-diffusion-webui to set up?

Setup difficulty is rated hard, with roughly 1h+ to a first successful run.

Who is stable-diffusion-webui for?

Mainly general.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub automatic1111 on gitmyhub

Verify against the repo before relying on details.