explaingit

yzhao062/anomaly-detection-resources

9,282PythonAudience · researcherComplexity · 1/5Setup · easy

TLDR

A curated reading list of academic papers, textbooks, lectures, benchmark datasets, and open-source Python libraries covering anomaly and outlier detection across tabular, time series, and graph data.

Mindmap

mindmap
  root((repo))
    What it covers
      Papers by topic
      Open-source tools
      Benchmark datasets
    Paper topics
      Time series methods
      Graph-based detection
      LLM approaches
      High-dimensional data
    Software tools
      Tabular data libs
      Time series tools
      Real-time Elasticsearch
    Resources
      Textbooks lectures
      Research conferences
      PyOD ADBench links
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Find academic papers on a specific anomaly detection method such as time series or graph-based outlier detection.

USE CASE 2

Discover Python libraries for detecting outliers in tabular, time-series, or graph data without building from scratch.

USE CASE 3

Locate public benchmark datasets to evaluate a new anomaly detection algorithm against established baselines.

USE CASE 4

Identify the major conferences and journals where anomaly detection research is published.

Tech stack

Python

Getting it running

Difficulty · easy Time to first run · 5min

In plain English

This repository is a curated reading list for anyone who wants to learn about anomaly detection, the practice of finding data points that behave unusually compared to everything else. The field goes by two names: anomaly detection and outlier detection. They mean the same thing. Examples of where it matters include catching fraudulent credit card charges, spotting network intrusions, and flagging defective parts coming off a production line. The repository does not contain runnable code. It is a structured collection of references: textbooks, academic papers, recorded lectures and seminars, datasets you can use for testing, and links to open-source software libraries. There are also pointers to the major research conferences and journals where new work in this area gets published. The author, a researcher at USC, maintains it alongside a separate Python library called PyOD and a benchmarking suite called ADBench, both of which appear in the resource links. The paper section is extensive and organized by topic. Separate sections cover methods that use large language models for anomaly detection, techniques that work with limited labeled data, approaches based on neural networks, methods designed for high-dimensional data, time series detection, graph-based detection, and surveys that give broad overviews of the field. Each entry lists the paper title, the venue where it was published, the year, and links to the PDF and code where available. The toolbox section groups software by data type. There are libraries for tabular data outlier detection, tools specifically for time series, tools for graph-structured data, and a real-time option built around Elasticsearch. The datasets section lists public collections commonly used to evaluate detection methods. This is a reference resource, not a starter project. If you are trying to understand what anomaly detection is, where it is used, what the main techniques are, or which software exists to do it, this list is a useful map of the territory. The repository was last updated in late 2025 with new entries covering large language model and vision-language model approaches. The full README is longer than what was shown.

Copy-paste prompts

Prompt 1
Based on the anomaly-detection-resources list, which Python library should I use to detect outliers in a tabular dataset with 50 numerical features?
Prompt 2
I want to detect anomalies in a time series of server CPU metrics. Which tools from this resource list should I start with?
Prompt 3
How do I use PyOD (referenced in anomaly-detection-resources) to run an isolation forest on a CSV of transaction data in Python?
Prompt 4
Find me survey papers on anomaly detection for high-dimensional data from this curated list that have available code.
Prompt 5
Which datasets in the anomaly-detection-resources list are commonly used to benchmark fraud detection algorithms?
Open on GitHub → Explain another repo

← yzhao062 on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.