An explainable and efficient deep learning framework for video anomaly detection

Chongke Wu, Sicong Shao, Cihan Tunc, Pratik Satam, Salim Hariri

Research output: Contribution to journalArticlepeer-review

25 Scopus citations


Deep learning-based video anomaly detection methods have drawn significant attention in the past few years due to their superior performance. However, almost all the leading methods for video anomaly detection rely on large-scale training datasets with long training times. As a result, many real-world video analysis tasks are still not applicable for fast deployment. On the other hand, the leading methods cannot provide interpretability due to the uninterpretable feature representations hiding the decision-making process when anomaly detection models are considered as a black box. However, the interpretability for anomaly detection is crucial since the corresponding response to the anomalies in the video is determined by their severity and nature. To tackle these problems, this paper proposes an efficient deep learning framework for video anomaly detection and provides explanations. The proposed framework uses pre-trained deep models to extract high-level concept and context features for training denoising autoencoder (DAE), requiring little training time (i.e., within 10 s on UCSD Pedestrian datasets) while achieving comparable detection performance to the leading methods. Furthermore, this framework presents the first video anomaly detection use of combing autoencoder and SHapley Additive exPlanations (SHAP) for model interpretability. The framework can explain each anomaly detection result in surveillance videos. In the experiments, we evaluate the proposed framework's effectiveness and efficiency while also explaining anomalies behind the autoencoder’s prediction. On the USCD Pedestrian datasets, the DAE achieved 85.9% AUC with a training time of 5 s on the USCD Ped1 and 92.4% AUC with a training time of 2.9 s on the UCSD Ped2.

Original languageEnglish (US)
Pages (from-to)2715-2737
Number of pages23
JournalCluster Computing
Issue number4
StatePublished - Aug 2022


  • Abnormal event detection
  • Anomaly video analysis
  • Context mining
  • Deep features
  • Interpretability
  • Security
  • Video surveillance

ASJC Scopus subject areas

  • Software
  • Computer Networks and Communications


Dive into the research topics of 'An explainable and efficient deep learning framework for video anomaly detection'. Together they form a unique fingerprint.

Cite this