explaingit

cvhub520/x-anylabeling

9,067PythonAudience · dataComplexity · 3/5LicenseSetup · moderate

TLDR

A desktop tool for labeling images and videos with AI assistance, letting you train machine learning models by marking objects in images far faster than doing it by hand.

Mindmap

mindmap
  root((x-anylabeling))
    What it does
      Image annotation
      Video labeling
      AI auto-label
    Label types
      Bounding boxes
      Segmentation masks
      Pose points
      Text regions
    AI models
      YOLO variants
      Segment Anything
      Grounding DINO
      Gemini ChatGPT
    Export formats
      COCO
      YOLO
      VOC
    Platforms
      Windows
      Linux
      macOS
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Label thousands of product photos with bounding boxes to build a training dataset for a custom YOLO detection model.

USE CASE 2

Use the Segment Anything model to click on any object and automatically generate a pixel-perfect outline mask.

USE CASE 3

Annotate video frames for pose estimation or document text detection using built-in OCR models.

USE CASE 4

Export finished annotations in COCO, YOLO, or VOC format ready to feed into a training pipeline.

Tech stack

PythonONNX RuntimeTensorRTYOLOSegment Anything

Getting it running

Difficulty · moderate Time to first run · 30min

Requires downloading specific ONNX model files separately for each AI model you want to use.

Use and modify freely, but if you distribute modified versions they must also be open source under LGPL v3.

In plain English

X-AnyLabeling is a desktop tool for annotating images and videos with the help of AI models. Annotation means drawing labels on images to mark where objects are, what type they are, or what they look like. This kind of labeled data is what AI models need during training, and creating it by hand is slow work. X-AnyLabeling is designed to speed that up by letting AI models do an initial pass at labeling, which a human can then review and correct. The tool supports a wide range of labeling types: bounding boxes around objects, pixel-level outlines (called segmentation masks), points for pose estimation, rotated boxes for objects at odd angles, text regions for document scanning, and more. It can work with both still images and video frames. Once you have finished labeling, it can export your annotations in formats used by popular training frameworks, including COCO, YOLO, VOC, and several others. What makes it more than a basic drawing tool is the built-in model library. You can run AI models like YOLO variants, Segment Anything (a model from Meta that can outline any object you click on), Grounding DINO (which finds objects based on text descriptions you type), OCR tools for reading text in images, and vision-language models including Gemini and ChatGPT integrations. These models run locally using the ONNX Runtime or TensorRT backends, meaning you do not need to send your data to an external server unless you choose the remote inference option. The interface supports English, Chinese, Japanese, and Korean. It runs on Windows, Linux, and macOS. You can add your own custom models if the built-in library does not cover your use case. X-AnyLabeling is actively maintained and receives regular updates. Recent additions include support for SAM 3, TensorRT-accelerated YOLO inference, PaddleOCR document parsing, and 3D cuboid annotation. It is licensed under LGPL v3.

Copy-paste prompts

Prompt 1
I have 500 unlabeled product photos and want to train a YOLO model. Walk me through using X-AnyLabeling to semi-automatically label bounding boxes and export in YOLO format.
Prompt 2
How do I use Segment Anything in X-AnyLabeling to click on an object and get a pixel-perfect mask automatically?
Prompt 3
I want to label text regions in scanned documents using X-AnyLabeling's OCR tools. What workflow should I follow?
Prompt 4
How do I add my own custom ONNX model to X-AnyLabeling and use it for auto-labeling my images?
Prompt 5
I need to annotate rotated bounding boxes for objects at odd angles. How does that work in X-AnyLabeling?
Open on GitHub → Explain another repo

← cvhub520 on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.