explaingit

ericchen931209/auto-dataset-builder

Analysis updated 2026-05-18

3PythonAudience · researcherComplexity · 4/5LicenseSetup · moderate

TLDR

A platform that builds labeled computer vision training datasets automatically from a plain-English description, using chained AI tools for annotation and quality scoring.

Mindmap

mindmap
  root((Auto Dataset Builder))
    Input
      Natural language query
    Data collection
      YouTube CC videos
      Google Image Search
      Frame extraction
    Annotation pipeline
      YOLOv11 proposals
      SAM2 refinement
      Vision LLM verify
    Quality scoring
      Neural DQS metric
      Six feature inputs
      Predicts model mAP
    Output
      YOLO format dataset
      COCO format export
    Setup
      Docker Compose
      No GPU required
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

What do people build with it?

USE CASE 1

Generate a labeled YOLO-format dataset for a custom object category without manually annotating any images.

USE CASE 2

Score the quality of an existing image dataset and predict the detection accuracy it will produce before training.

USE CASE 3

Run an active learning loop to automatically improve a dataset that scored below your quality threshold.

What is it built with?

PythonFastAPIVue 3DockerPyTorchYOLOv11SAM2Celery

How does it compare?

ericchen931209/auto-dataset-builder0marildo/imagoagentlexi/agent-lexi
Stars333
LanguagePythonPythonPython
Setup difficultymoderateeasymoderate
Complexity4/52/54/5
Audienceresearchergeneralvibe coder

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · moderate Time to first run · 30min

Requires Docker Desktop, first run pulls images and takes about 3 minutes. Google Search API key is optional.

MIT license, use freely for any purpose, including commercial, as long as you keep the copyright notice.

In plain English

Auto Dataset Builder (ADB) is a platform that creates labeled training datasets for computer vision models, starting from nothing but a plain English description. You type something like "Build a Taiwan motorcycle detection dataset" and the system handles everything from finding images to annotating them and checking their quality. The pipeline has several stages that run automatically. First it collects images and video frames from YouTube (Creative Commons licensed clips only) and optionally from Google Image Search. It then extracts useful frames, runs them through three annotation steps in sequence, cleans out blurry or poorly lit images, and exports a finished dataset in the format used by YOLO object detection models. The annotation process chains three tools together. YOLOv11, an object detection model, proposes initial bounding boxes around objects. SAM2, a segmentation model, refines those boxes into more accurate boundaries. A vision language model then verifies each annotation, acting as a final check on accuracy. The project's main research contribution is a quality metric called Neural DQS (Dataset Quality Score). It takes six measurements of a dataset, including how diverse the images are, how sharp they are, how well-balanced the object classes are, and how accurate the annotations are, then feeds those numbers into a small trained model to predict the mean average precision (mAP) the trained detector will achieve before you even start training. According to the README, this score correlated at 0.929 with actual mAP across 96 test datasets. If the quality score falls below a threshold, an active learning loop runs to collect more varied examples and re-annotate them until quality improves. The system includes a web dashboard built with Vue 3 and a FastAPI backend. Setup is Docker-based: clone the repo, run docker compose up, and a browser dashboard opens at localhost:3000. No GPU is required. Optionally add a Google Search API key for image collection. The project is MIT licensed and is accompanied by a published research paper on Zenodo.

Copy-paste prompts

Prompt 1
I'm using auto-dataset-builder to create a motorcycle detection dataset. How do I submit a job via the API and check its status?
Prompt 2
Explain what the Neural DQS score measures and which of its six features matters most for improving model accuracy.
Prompt 3
I want to add a new image source to auto-dataset-builder beyond YouTube and Google Search. Where in the workers/collector code do I add it?
Prompt 4
Walk me through the three-stage annotation pipeline in auto-dataset-builder: how does YOLOv11 hand off to SAM2, and when does the vision LLM step run?

Frequently asked questions

What is auto-dataset-builder?

A platform that builds labeled computer vision training datasets automatically from a plain-English description, using chained AI tools for annotation and quality scoring.

What language is auto-dataset-builder written in?

Mainly Python. The stack also includes Python, FastAPI, Vue 3.

What license does auto-dataset-builder use?

MIT license, use freely for any purpose, including commercial, as long as you keep the copyright notice.

How hard is auto-dataset-builder to set up?

Setup difficulty is rated moderate, with roughly 30min to a first successful run.

Who is auto-dataset-builder for?

Mainly researcher.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub ericchen931209 on gitmyhub

Verify against the repo before relying on details.