Visual Motion Segmentation in Crowd Videos Based on Spatial-Angular Stacked Sparse Autoencoders

Adel Hafeezallah; Ahlam Al-Dhamari; Syed Abd

doi:10.32604/csse.2023.039479

Open Access icon Open Access

ARTICLE

Visual Motion Segmentation in Crowd Videos Based on Spatial-Angular Stacked Sparse Autoencoders

Adel Hafeezallah¹, Ahlam Al-Dhamari^2,3,*, Syed Abd Rahman Abu-Bakar²

1 Department of Electrical Engineering, Taibah University, Madinah, Saudi Arabia
2 Department of Electronic and Computer Engineering, Faculty of Electrical Engineering, Universiti Teknologi Malaysia, Johor Bahru, 81310, Malaysia
3 Department of Computer Engineering, Hodeidah University, Hodeidah, Yemen

* Corresponding Author: Ahlam Al-Dhamari. Email: email

Computer Systems Science and Engineering 2023, 47(1), 593-611. https://doi.org/10.32604/csse.2023.039479

Received 31 January 2023; Accepted 20 March 2023; Issue published 26 May 2023

Abstract

Visual motion segmentation (VMS) is an important and key part of many intelligent crowd systems. It can be used to figure out the flow behavior through a crowd and to spot unusual life-threatening incidents like crowd stampedes and crashes, which pose a serious risk to public safety and have resulted in numerous fatalities over the past few decades. Trajectory clustering has become one of the most popular methods in VMS. However, complex data, such as a large number of samples and parameters, makes it difficult for trajectory clustering to work well with accurate motion segmentation results. This study introduces a spatial-angular stacked sparse autoencoder model (SA-SSAE) with l2-regularization and softmax, a powerful deep learning method for visual motion segmentation to cluster similar motion patterns that belong to the same cluster. The proposed model can extract meaningful high-level features using only spatial-angular features obtained from refined tracklets (a.k.a ‘trajectories’). We adopt l2-regularization and sparsity regularization, which can learn sparse representations of features, to guarantee the sparsity of the autoencoders. We employ the softmax layer to map the data points into accurate cluster representations. One of the best advantages of the SA-SSAE framework is it can manage VMS even when individuals move around randomly. This framework helps cluster the motion patterns effectively with higher accuracy. We put forward a new dataset with its manual ground truth, including 21 crowd videos. Experiments conducted on two crowd benchmarks demonstrate that the proposed model can more accurately group trajectories than the traditional clustering approaches used in previous studies. The proposed SA-SSAE framework achieved a 0.11 improvement in accuracy and a 0.13 improvement in the F-measure compared with the best current method using the CUHK dataset.

Keywords

Visual motion segmentation; crowd behavior analysis; trajectory analysis; crowd dynamics; autoencoders; motion patterns

Cite This Article

APA Style

Hafeezallah, A., Al-Dhamari, A., Abu-Bakar, S.A.R. (2023). Visual Motion Segmentation in Crowd Videos Based on Spatial-Angular Stacked Sparse Autoencoders. Computer Systems Science and Engineering, 47(1), 593–611. https://doi.org/10.32604/csse.2023.039479

Vancouver Style

Hafeezallah A, Al-Dhamari A, Abu-Bakar SAR. Visual Motion Segmentation in Crowd Videos Based on Spatial-Angular Stacked Sparse Autoencoders. Comput Syst Sci Eng. 2023;47(1):593–611. https://doi.org/10.32604/csse.2023.039479

IEEE Style

A. Hafeezallah, A. Al-Dhamari, and S. A. R. Abu-Bakar, “Visual Motion Segmentation in Crowd Videos Based on Spatial-Angular Stacked Sparse Autoencoders,” Comput. Syst. Sci. Eng., vol. 47, no. 1, pp. 593–611, 2023. https://doi.org/10.32604/csse.2023.039479

BibTex EndNote RIS

Copyright © 2023 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Visual Motion Segmentation in Crowd Videos Based on Spatial-Angular Stacked Sparse Autoencoders

Abstract

Keywords

Cite This Article

1314

828

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link