TY - EJOU AU - Wang, Jiaqi AU - Zhang, Jie AU - Ji, Genlin AU - Sheng, Bo TI - Criss-Cross Attention Based Auto Encoder for Video Anomaly Event Detection T2 - Intelligent Automation \& Soft Computing PY - 2022 VL - 34 IS - 3 SN - 2326-005X AB - The surveillance applications generate enormous video data and present challenges to video analysis for huge human labor cost. Reconstruction-based convolutional autoencoders have achieved great success in video anomaly detection for their ability of automatically detecting abnormal event. The approaches learn normal patterns only with the normal data in an unsupervised way due to the difficulty of collecting anomaly samples and obtaining anomaly annotations. But convolutional autoencoders have limitations in global feature extraction for the local receptive field of convolutional kernels. What is more, 2-dimensional convolution lacks the capability of capturing temporal information while videos change over time. In this paper, we propose a method established on Criss-Cross attention based AutoEncoder (CCAE) for capturing global visual features of sequential video frames. The method utilizes Criss-Cross attention based encoder to extract global appearance features. Another Criss-Cross attention module is embedded into bi-directional convolutional long short-term memory in hidden layer to explore global temporal features between frames. A decoder is executed to fuse global appearance and temporal features and reconstruct the frames. We perform extensive experiments on two public datasets UCSD Ped2 and CUHK Avenue. The experimental results demonstrate that CCAE achieves superior detection accuracy compared with other video anomaly detection approaches. KW - Video anomaly detection; bi-directional long short-term memory; convolutional autoencoder; Criss-Cross attention module DO - 10.32604/iasc.2022.029535