Analysis updated 2026-06-24
Train the model on your own mockup and HTML pairs
Run the Bootstrap notebook to convert sample screenshots into HTML
Study an encoder decoder architecture for image to code
Adapt the pipeline for a different target language like JSX
| emilwallner/screenshot-to-code | windmill-labs/windmill | trycua/cua | |
|---|---|---|---|
| Stars | 16,482 | 16,469 | 16,526 |
| Language | HTML | HTML | HTML |
| Setup difficulty | hard | hard | hard |
| Complexity | 4/5 | 4/5 | 4/5 |
| Audience | researcher | ops devops | researcher |
Figures from each repo's GitHub metadata at analysis time.
Old Keras and TensorFlow versions plus a GPU make environment setup the main hurdle.
This project is a neural network (an AI system modeled loosely on how the brain works) that takes a screenshot or design mockup image and automatically converts it into working HTML and CSS code, the markup languages used to build web pages. Instead of a designer handing a mockup to a developer who then manually translates it into code, this AI does that translation step automatically. The system is built in three stages. The simplest version (called "Hello World") handles basic layouts, the middle version works with raw HTML, and the most capable version uses a component framework called Bootstrap. The Bootstrap version achieves 97% accuracy and is the only one that can handle new, unseen design mockups reliably. It works by learning 16 design tokens, simplified labels for UI components, that it then translates into HTML and CSS. The model is built using Keras, a deep learning library, running on top of a Jupyter Notebook environment, which is an interactive coding tool popular in data science. It uses an encoder-decoder architecture: the encoder reads the design image, and the decoder generates code token by token. The best-performing model uses a GRU (a type of recurrent neural network unit) instead of an older variant called LSTM. You would use this project if you are a developer or researcher interested in experimenting with AI-driven front-end code generation, or if you want to learn how image-to-code translation models work. It is more of a research prototype than a production tool, the dataset it was trained on is small and homogeneous, so performance on complex real-world layouts may vary.
Research neural net that turns a design screenshot into HTML and CSS. Encoder decoder model in Keras with three sample versions ending in a Bootstrap output.
Mainly HTML. The stack also includes Python, Keras, Jupyter.
Setup difficulty is rated hard, with roughly 1day+ to a first successful run.
Mainly researcher.
This repo across BitVibe Labs
Verify against the repo before relying on details.