Run object detection and instance segmentation on photos using pre-trained COCO weights without training from scratch.
Train a custom model across multiple GPUs using distributed training with mixed-precision for faster runs.
Use the webcam or Jupyter notebook demo to see bounding boxes and segmentation masks drawn in real time.
Requires a CUDA-capable GPU and the COCO dataset download (~20GB) before training.
This repository, created by Facebook Research, is a now-deprecated Python library for training and running computer vision models that can detect objects in images and draw precise outlines around them. The two main tasks it covers are object detection, which draws boxes around recognized items in a photo, and instance segmentation, which goes further by tracing the exact pixel boundary of each object rather than just a box. The README notes upfront that this project has been replaced by a newer Facebook Research library called Detectron2, which covers everything this one did. Anyone starting fresh is directed there instead. For those who encounter this older codebase, it was built on top of PyTorch and was notable at release for being faster and less memory-hungry than competing implementations. It could train across multiple GPUs at once and supported a mixed-precision mode that sped things up further on compatible hardware. Inference, the step of running the model on new images to get predictions, could also run on a regular CPU without a GPU. The library includes a webcam demo and a Jupyter notebook demo so you can see the model working in real time, outlining objects the camera sees. Training required downloading a large image dataset called COCO, which contains labeled examples for over 80 object categories. Pre-trained model weights were provided so you could run inference without training from scratch. Configuration was file-based: you picked a YAML config file specifying the model architecture and training settings, then passed it to a training script. Multi-GPU runs used PyTorch's built-in distributed training launcher. The project is open source, created for research purposes, and the code remains publicly available even though active development has moved to Detectron2.
← facebookresearch on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.