VideoPatchCore: An Effective Method to Memorize Normality for Video Anomaly Detection (ACCV'24)

This repository is the official open-source of VideoPatchCore: An Effective Method to Memorize Normality for Video Anomaly Detection by Sunghyun Ahn, Youngwan Jo, Kijung Lee and Sanghyun Park.

📣 News

[2024/10/09] Instructions for data preparation are released!
[2024/09/25] Our codes and memories are released!
[2024/09/20] Our VPC paper has been accepted to ACCV 2024!

Description

Currently, VAD is gaining attention with memory techniques that store the features of normal frames. The stored features are utilized for frame reconstruction or prediction, identifying an abnormality when a significant difference exists between the generated and GT frames. However, this approach faces several challenges due to the simultaneous optimization required for both the memory and encoder-decoder model. These challenges include increased optimization difficulty, complexity of implementation, and performance variability depending on the memory size. To address these challenges, we propose an effective memory method for VAD, called VideoPatchCore. Inspired by PatchCore, our approach introduces a structure that prioritizes memory optimization and configures three types of memory tailored to the characteristics of video data. This method effectively addresses the limitations of existing memory-based methods, achieving good performance comparable to state-of-the-art methods.

Dependencies

python >= 3.8
torch = 1.13.1+cu117
torchvision = 0.14.1+cu117
scikit-learn = 1.0.2
opencv-python
h5py
fastprogress
Other common packages.

Notes

Vision Encoder is based on openai-clip and Object Detector is based on YOLOv5. Please click the link to download the package. Thanks to the authors for their great work.

Datasets

Please follow the instructions to prepare the training and testing dataset.
You can specify the dataset's path by editing 'data_root' in extra/config.py.

CUHK Avenue	Shnaghai Tech.	IITB Corridor
Official Site	Official Site	Official Site

Object Detection

Input the path of the working directory where the object files will be stored in 'work_dir' of ObjectDetection/extra/config.py
Navigate to the ObjectDetection directory and enter the following command.
You can input dataset_name as one of the following choices: avenue, shanghai, iitb.
We set the consecutive to 10 for avenue and 4 for shanghai and iitb.
object files are saved in the objects directory of the working directory.

# default option for object detection
python run.py --work_num=0 --dataset={dataset_name}
# change number of input frames
python run.py --work_num=0 --dataset={dataset_name} --consecutive=10
# save bounding box file
python run.py --work_num=0 --dataset={dataset_name} --is_save_train_pickle=True --is_save_test_pickle=True
# load bounding box and save object batches
python run.py --work_num=0 --dataset={dataset_name} --is_load_train_pickle=True --is_load_test_pickle=True
# save images with bounding boxes
python run.py --work_num=0 --dataset={dataset_name} --save_image=True
# save all detected object images
python run.py --work_num=0 --dataset={dataset_name} --save_image_all=True

Memorization and Inference

Input the path used for Object Detection in 'work_dir' of Memorization/extra/config.py
Navigate to the Memorization directory and enter the following command.
Enter the following command to perform memorization and inference.
lf files and spatial & temporal memory banks are saved in the l_features directory of the working directory.
gf files and high-level semantic memory bank are saved in the g_features directory of the working directory.

# recommended option for avenue dataset 
python run.py \
    --work_num=0 --consecutive=10 --dataset=avenue --cnl_pool=32 \
    --spatial_f_coreset=0.01 --temporal_f_coreset=0.01 --highlevel_f_coreset=0.01 

# recommended option for shanghai dataset 
python run.py \
    --work_num=0 --consecutive=4 --dataset=shanghai --cnl_pool=64 \
    --spatial_f_coreset=0.25 --temporal_f_coreset=0.25 --highlevel_f_coreset=0.25

# recommended option for iitb dataset 
python run.py \
    --work_num=0 --consecutive=4 --dataset=iitb --cnl_pool=64 \
    --spatial_f_coreset=0.1 --temporal_f_coreset=0.1 --highlevel_f_coreset=0.1

# save test features and perform inference using saved memories
python run.py \
    --work_num=0 --consecutive=4 --dataset=iitb --cnl_pool=64 \
    --spatial_f_coreset=0.1 --temporal_f_coreset=0.1 --highlevel_f_coreset=0.1 \
    --save_memory=False

# perform inference using the saved test features and memories
python run.py \
    --work_num=0 --consecutive=4 --dataset=iitb --cnl_pool=64 \
    --spatial_f_coreset=0.1 --temporal_f_coreset=0.1 --highlevel_f_coreset=0.1 \
    --save_feature=False --save_memory=False

Bounding box and Memory

The following is the working directory used for the experiments. This directory includes the bounding box and memory files.
We provide the structure of the working directory. Please refer to the link below.
We provide the code used for the experiments through Google Colab.

Working Directory	Directory Structure	Experiment Code
Google Drive	Github README	Google Colab

Qualitative Evaluation

We achieved excellent video anomaly detection by leveraging three memory components effectively.

	Demo
Running
Jumping
Throwing a bag
Wrong direction

Citation

If you use our work, please consider citing:

@article{ahn2024videopatchcore,
  title={VideoPatchCore: An Effective Method to Memorize Normality for Video Anomaly Detection},
  author={Ahn, Sunghyun and Jo, Youngwan and Lee, Kijung and Park, Sanghyun},
  journal={arXiv preprint arXiv:2409.16225},
  year={2024}
}

Contact

Should you have any question, please create an issue on this repository or contact me at [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
Memorization		Memorization
ObjectDetection		ObjectDetection
DATA_README.md		DATA_README.md
README.md		README.md
WORK_README.md		WORK_README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VideoPatchCore: An Effective Method to Memorize Normality for Video Anomaly Detection (ACCV'24)

📣 News

Description

Dependencies

Notes

Datasets

Object Detection

Memorization and Inference

Bounding box and Memory

Qualitative Evaluation

Citation

Contact

About

Releases

Packages

Languages

sareerulamin/Paper-VideoPatchCore

Folders and files

Latest commit

History

Repository files navigation

VideoPatchCore: An Effective Method to Memorize Normality for Video Anomaly Detection (ACCV'24)

📣 News

Description

Dependencies

Notes

Datasets

Object Detection

Memorization and Inference

Bounding box and Memory

Qualitative Evaluation

Citation

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages