Spectrum - Time Series Anomaly Detection Algorithms

A comprehensive collection of time series anomaly detection algorithms for identifying anomalous patterns in temporal data. This project provides implementations of state-of-the-art deep learning and machine learning approaches for unsupervised anomaly detection.

Quick Start

Prerequisites

Python 3.8+
conda

Installation

# Clone the repository
git clone https://github.com/DeepShield-AI/spectrum.git
cd spectrum

# Create conda environment
conda env create -f environment.yml
conda activate spectrum

Datasets

Supported Datasets

We support the following public time series anomaly detection datasets:

Dataset	Domain	Description	Dimensions	Anomaly Rate
MSL	Space	Mars Science Laboratory telemetry	55	~10.7%
SMAP	Space	Soil Moisture Active Passive satellite	25	~13.1%
SMD	IT	Server Machine Dataset	38	~4.2%
PSM	IT	Pooled Server Metrics	25	~27.9%
SWAT	Industrial	Secure Water Treatment testbed	51	~12.1%
KPI	Web	Key Performance Indicators	1	Variable

Data Download & Setup

Download datasets from: https://cloud.tsinghua.edu.cn/d/75ceadaca416485e9f09/ Download the datasets.zip file and move it to the spectrum directory
Unzip datasets:

unzip datasets.zip

Extract and organize the data as follows:

datasets/
└── kpi/
    ├── raw/
    │   ├── phase2_train.csv            # training set
    │   ├── phase2_train.csv.zip        # raw training set
    │   └── phase2_ground_truth.hdf     # test set
    │   └── phase2_ground_truth.hdf.zip # raw test set
    │   └── phase2_ground_truth.parquet # test set in Parquet format
    └── train/                          # training set divided according to KPI ID
    └── test/                           # test set divided according to KPI ID

Run preprocessing scripts:

# KPI-specific preprocessing (with missing value handling)
cd exp/preprocess/

# Manual cell execution instructions:
# 1. After opening the kpi.ipynb, run cells sequentially
# 2. Or use Cell -> Run All to execute all cells at once
# 3. For step-by-step execution, use Cell -> Run Cells to run selected cells
# 4. Monitor the output and adjust parameters as needed between cells

jupyter notebook kpi.ipynb  # Interactive preprocessing

Data Processing Pipeline

Our preprocessing pipeline includes:

Data Loading & Validation
- Format standardization (CSV/HDF to Parquet)
- Schema validation and type conversion
- Timestamp normalization
Missing Value Analysis & Imputation
- Gap detection and characterization
- Missing value statistics and visualization
Data Splitting & Normalization
- Train/validation/test splits
- Min-max or z-score normalization
- Sliding window generation

Processing Scripts Location:

exp/preprocess/kpi.ipynb - Interactive KPI preprocessing

Algorithms

Implemented Algorithms

Algorithm	Type	Paper	Documentation
SRCNN	CNN-based	Spectral Residual CNN	docs/SRCNN.md

Algorithm Categories

Deep Autoencoders: USAD, DAGMM, Donut
Spectral Methods: SRCNN, SaVAE-SR
Temporal Convolution: ModernTCN
Clustering-based: DCdetector, DAGMM

Results & Evaluation

Evaluation Metrics

Precision: True positives / (True positives + False positives)
Recall: True positives / (True positives + False negatives)
F1-Score: Harmonic mean of precision and recall
AUC-ROC: Area under the ROC curve
AUC-PR: Area under the Precision-Recall curve

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
docs		docs
figures		figures
notebooks		notebooks
scripts		scripts
spectrum		spectrum
tests		tests
tools		tools
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Spectrum - Time Series Anomaly Detection Algorithms

Quick Start

Prerequisites

Installation

Datasets

Supported Datasets

Data Download & Setup

Data Processing Pipeline

Algorithms

Implemented Algorithms

Algorithm Categories

Results & Evaluation

Evaluation Metrics

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

DeepShield-AI/spectrum

Folders and files

Latest commit

History

Repository files navigation

Spectrum - Time Series Anomaly Detection Algorithms

Quick Start

Prerequisites

Installation

Datasets

Supported Datasets

Data Download & Setup

Data Processing Pipeline

Algorithms

Implemented Algorithms

Algorithm Categories

Results & Evaluation

Evaluation Metrics

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages