Click to track a moving person or object through a long video for annotation datasets or video editing.
Remove an unwanted object from a video by selecting it with a click and letting the tool fill in the background automatically.
Annotate multiple objects simultaneously in a research dataset with interactive corrections when the tracker drifts.
Build a video editing pipeline that can isolate and track moving subjects without manual frame-by-frame work.
Requires a GPU with sufficient VRAM to run Segment Anything and XMem simultaneously, a Hugging Face demo is available to try without local setup.
Track-Anything is a Python tool that lets you click on an object in a video and have it automatically follow and outline that object through every subsequent frame. You do not need to write any code or draw precise boundaries yourself, you just click on what you want to track and the system does the rest. Under the hood, it combines three separate AI models. Segment Anything (from Meta) handles the initial click-to-outline step: you click a point on an object and it generates a precise mask around that object. XMem then propagates that mask across video frames, keeping track of the object as it moves, changes size, or gets temporarily hidden. If you want to remove an object from the video rather than just track it, a third model called E2FGVI fills in the background behind it so the object appears to have never been there. The tool handles several scenarios that simpler trackers cannot: tracking multiple objects at the same time, handling camera cuts (where the scene changes abruptly), and letting you interactively correct the tracking region if the model drifts onto the wrong area mid-video. These features make it useful for video annotation work, where a human expert can guide the AI and fix mistakes in real time. Setup is a standard Python install: clone the repository, install dependencies, and run a single command to start a browser-based interface. A live demo is available on Hugging Face for trying it without any local setup. The project is a research tool from the SUSTech VIP Lab, published in 2023. It is not a commercial product and the README is primarily aimed at researchers and developers who want to use or build on these capabilities.
← gaomingqi on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.