facebookresearch/convnext

Analysis updated 2026-07-03 · repo last pushed 2023-01-08

★ 6,392PythonAudience · researcherComplexity · 3/5DormantSetup · moderate

Mindmap

mindmap
  root((convnext))
    What It Does
      Image classification
      Fine-tuning
      Feature extraction
    Model Sizes
      Tiny and small
      Base and large
      Extra large
    Tech Stack
      Python
      PyTorch
    Use Cases
      Photo sorting
      Medical imaging
      Visual inspection
    Audience
      ML researchers
      AI engineers

mindmap root((convnext)) What It Does Image classification Fine-tuning Feature extraction Model Sizes Tiny and small Base and large Extra large Tech Stack Python PyTorch Use Cases Photo sorting Medical imaging Visual inspection Audience ML researchers AI engineers

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Classify the contents of photos using a pre-trained ConvNeXt model without any training.

USE CASE 2

Fine-tune a ConvNeXt model on your own labeled image dataset to recognize custom categories.

USE CASE 3

Run automated visual inspection on a manufacturing line to detect defective parts from photos.

USE CASE 4

Build a document scanning app that automatically classifies document types using ConvNeXt.

What is it built with?

PythonPyTorch

How does it compare?

	facebookresearch/convnext	robbyant/lingbot-map	hkuds/vimax
Stars	6,392	6,393	6,399
Language	Python	Python	Python
Last pushed	2023-01-08	—	—
Maintenance	Dormant	—	—
Setup difficulty	moderate	hard	hard
Complexity	3/5	4/5	4/5
Audience	researcher	researcher	researcher

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · moderate Time to first run · 30min

Requires PyTorch installed, a GPU is strongly recommended for fine-tuning on custom datasets.

In plain English

ConvNeXt is a modern image recognition model that works like a traditional camera-based AI, but rebuilt from the ground up using ideas from recent breakthroughs. The core benefit is that it achieves top-tier accuracy on real-world image classification tasks while remaining simple, efficient, and practical to deploy. Instead of using trendy attention mechanisms (the approach that powers large language models), ConvNeXt sticks with straightforward convolutional layers, making it faster and more resource-friendly for common computer vision work. At a high level, the model takes an image as input and outputs what it thinks the image contains, like "this is a dog" or "this is a cat." It comes in different sizes (tiny, small, base, large, extra-large), so you can pick one that matches your hardware and speed requirements. The repository includes pre-trained versions of these models that have already been trained on millions of labeled images, so you don't need to train from scratch. You can either use them as-is to classify new images, or fine-tune them on your own image collection to teach them specialized tasks. Researchers and machine learning engineers would use this for tasks like building image search systems, automating photo sorting, detecting objects in videos, or powering visual inspection in manufacturing. A startup building a document scanning app, for example, could use ConvNeXt to automatically classify document types. A research lab studying medical imaging could fine-tune it to spot anomalies in X-rays. The repository provides everything needed: the model code, training recipes, evaluation tools, and ready-to-download pre-trained weights. The project notably chooses simplicity over complexity. While other modern models chase cutting-edge techniques, ConvNeXt proves that a well-designed classical approach, combining traditional convolutions with modern training tricks, can match or beat fancier alternatives. The README includes performance numbers showing accuracy across different model sizes and input resolutions, along with downloadable trained models so you can start using it immediately without expensive training runs.

Copy-paste prompts

Prompt 1

Using a pre-trained ConvNeXt-Tiny model, show me how to classify what is in an image using Python and PyTorch.

Prompt 2

How do I fine-tune ConvNeXt-Small on my own image dataset with custom categories using PyTorch?

Prompt 3

I want to use ConvNeXt for binary classification of medical X-ray images. Show me how to replace the final layer and train it.

Prompt 4

Show me how to download pre-trained ConvNeXt weights and run inference on a folder of images in Python.

Frequently asked questions

What is convnext?

ConvNeXt is a state-of-the-art image classification AI model from Meta Research that matches cutting-edge accuracy while staying simple and efficient, with pre-trained weights ready to download.

What language is convnext written in?

Mainly Python. The stack also includes Python, PyTorch.

Is convnext actively maintained?

Dormant — no commits in 2+ years (last push 2023-01-08).

How hard is convnext to set up?

Setup difficulty is rated moderate, with roughly 30min to a first successful run.

Who is convnext for?

Mainly researcher.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub facebookresearch on gitmyhub

Verify against the repo before relying on details.