explaingit

ultralytics/yolov5

57,392PythonAudience · developerComplexity · 3/5ActiveLicenseSetup · moderate

TLDR

Real-time object detection model that identifies and locates multiple objects in images and videos. Comes in multiple sizes for different speed/accuracy tradeoffs and supports training on custom datasets.

Mindmap

mindmap
  root((YOLOv5))
    What it does
      Detects objects in images
      Locates objects in videos
      Real-time processing
    Capabilities
      Object detection
      Image segmentation
      Image classification
    Deployment
      ONNX format
      CoreML for iOS
      TFLite for mobile
      CPU and GPU
    Use cases
      Security monitoring
      Quality control
      Autonomous vehicles
      Augmented reality
    Tech stack
      Python
      PyTorch
      NVIDIA GPU support
      Apple Silicon

Things people build with this

USE CASE 1

Build a security camera system that detects people and vehicles in real time.

USE CASE 2

Train a custom model to identify defects on a manufacturing assembly line.

USE CASE 3

Deploy object detection on a mobile app or embedded device using TFLite or CoreML.

USE CASE 4

Create an augmented reality app that recognizes and labels objects in the camera feed.

Tech stack

PythonPyTorchONNXCoreMLTFLite

Getting it running

Difficulty · moderate Time to first run · 30min

PyTorch installation and model download can take 10-15 minutes depending on internet speed and system.

Use freely for any purpose, including commercial use, as long as you keep the copyright notice and license text.

In plain English

YOLOv5 is a computer vision model for detecting and identifying objects in images and videos in real time. The name YOLO stands for You Only Look Once, describing an approach to object detection where the model scans an image a single time and produces all detection results simultaneously, rather than processing it in multiple passes. This makes it fast enough for real-time video applications. The model can identify the location and category of multiple objects in a single image, drawing bounding boxes around each detected item and labeling them. YOLOv5 was developed by Ultralytics and is built on the PyTorch deep learning framework. It comes in several size variants trading speed against accuracy, from a small model suitable for embedded hardware to a larger model for maximum precision. The repository includes tools for training the model on your own custom dataset of labeled images, running inference on images or video streams, and exporting the trained model to various deployment formats including ONNX (a portable model format), Apple CoreML for iOS apps, and TFLite for Android or embedded devices. It also supports image segmentation and image classification tasks beyond the core object detection capability. The project integrates with Ultralytics Hub for cloud training and model management. The README notes that a newer model called YOLO11 is now available with improved performance, and encourages users to consider upgrading. The tech stack is Python with PyTorch, and deployment can target CPUs, NVIDIA GPUs, Apple Silicon, and mobile chipsets. You would use YOLOv5 when building a system that needs to identify and locate objects in images or video, such as security cameras, manufacturing quality control, autonomous vehicles, or augmented reality applications.

Copy-paste prompts

Prompt 1
How do I train YOLOv5 on my own dataset of labeled images for custom object detection?
Prompt 2
Show me how to run YOLOv5 inference on a video file and save the output with bounding boxes.
Prompt 3
What's the difference between YOLOv5's small, medium, and large model variants, and which should I use for a mobile app?
Prompt 4
How do I export a trained YOLOv5 model to ONNX or TFLite format for deployment?
Prompt 5
Can I use YOLOv5 for image segmentation or classification tasks, or is it only for object detection?
Open on GitHub → Explain another repo

Generated 2026-05-18 · Model: sonnet-4-6 · Verify against the repo before relying on details.