[1]XING Tianyi,GUO Maozu,CHEN Jiadong,et al.Detection of abnormal crowd behavior based on spatial-temporal adversarial variational autoencoder[J].CAAI Transactions on Intelligent Systems,2023,18(5):994-1004.[doi:10.11992/tis.202303002]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
18
Number of periods:
2023 5
Page number:
994-1004
Column:
学术论文—机器学习
Public date:
2023-09-05
- Title:
-
Detection of abnormal crowd behavior based on spatial-temporal adversarial variational autoencoder
- Author(s):
-
XING Tianyi1; GUO Maozu1; CHEN Jiadong1; ZHAO Lingling2; CHEN Linxin2; TIAN Le1
-
1. School of Electrical and Information Engineering, Beijing, University of Civil Engineering and Architecture, Beijing 100044 China;
2. Faculty of Computing, Harbin Institute of Technology, Harbin 150001, China
-
- Keywords:
-
detection on abnormal crowd behavior; variational autoencoder; autoencoder; long short-term memory network; adversarial network; spatiotemporal adversarial variational autoencoder; reconstruction errors; abnormal escape behavior
- CLC:
-
TP181
- DOI:
-
10.11992/tis.202303002
- Abstract:
-
Video-based detection of abnormal crowd behavior is important for the early discovery of safety risks and the prevention of group safety accidents. To address insufficient direct learning of the representation of abnormal samples because of the scarcity of abnormal crowd behavior events and the low detection accuracy of abnormal events, this study proposed a predictive spatiotemporal adversarial variational autoencoder (ST-AVAE) video anomaly detection model based on variational autoencoder, by adding the long short-term memory and adversarial network modules. Joint feature representation and reconstruction of the temporal and spatial dimensions of normal sample video sequences were performed, which reduced the feature loss in the reconstruction process of normal samples, thereby expanding the prediction loss of abnormal samples, avoiding dependence on abnormal samples, and realizing the detection of the abnormal behavior of crowd dispersal based on model reconstruction errors. Comparative experiments were conducted on the public dataset UMN and captured video datasets to prove that the ST-AVAE model has the optimal detection accuracy and recall rate in the detection of abnormal crowd escape behavior based on surveillance video, and the adversarial network module significantly improves the performance of anomaly detection.