Multi-Target Tracking of Person Based on Deep Learning

Xujun Li; Guodong Fang; Liming Rao; Tengze Zhang

doi:10.32604/csse.2023.038154

Open Access icon Open Access

ARTICLE

Multi-Target Tracking of Person Based on Deep Learning

Xujun Li^*, Guodong Fang, Liming Rao, Tengze Zhang

Physics and Optoelectronic Engineering College, Xiangtan University, Xiangtan, 411105, China

* Corresponding Author: Xujun Li. Email: email

Computer Systems Science and Engineering 2023, 47(2), 2671-2688. https://doi.org/10.32604/csse.2023.038154

Received 29 November 2022; Accepted 17 February 2023; Issue published 28 July 2023

Abstract

To improve the tracking accuracy of persons in the surveillance video, we proposed an algorithm for multi-target tracking persons based on deep learning. In this paper, we used You Only Look Once v5 (YOLOv5) to obtain person targets of each frame in the video and used Simple Online and Realtime Tracking with a Deep Association Metric (DeepSORT) to do cascade matching and Intersection Over Union (IOU) matching of person targets between different frames. To solve the IDSwitch problem caused by the low feature extraction ability of the Re-Identification (ReID) network in the process of cascade matching, we introduced Spatial Relation-aware Global Attention (RGA-S) and Channel Relation-aware Global Attention (RGA-C) attention mechanisms into the network structure. The pre-training weights are loaded for Transfer Learning training on the dataset CUHK03. To enhance the discrimination performance of the network, we proposed a new loss function design method, which introduces the Hard-Negative-Mining way into the benchmark triplet loss. To improve the classification accuracy of the network, we introduced a Label-Smoothing regularization method to the cross-entropy loss. To facilitate the model’s convergence stability and convergence speed at the early training stage and to prevent the model from oscillating around the global optimum due to excessive learning rate at the later stage of training, this paper proposed a learning rate regulation method combining Linear-Warmup and exponential decay. The experimental results on CUHK03 show that the mean Average Precision (mAP) of the improved ReID network is 76.5%. The Top 1 is 42.5%, the Top 5 is 65.4%, and the Top 10 is 74.3% in Cumulative Matching Characteristics (CMC); Compared with the original algorithm, the tracking accuracy of the optimized DeepSORT tracking algorithm is improved by 2.5%, the tracking precision is improved by 3.8%. The number of identity switching is reduced by 25%. The algorithm effectively alleviates the IDSwitch problem, improves the tracking accuracy of persons, and has a high practical value.

Keywords

YOLOv5; DeepSORT; deep learning; attention mechanism; person re-identification; multi-target tracking

Cite This Article

APA Style

Li, X., Fang, G., Rao, L., Zhang, T. (2023). Multi-target tracking of person based on deep learning. Computer Systems Science and Engineering, 47(2), 2671-2688. https://doi.org/10.32604/csse.2023.038154

Vancouver Style

Li X, Fang G, Rao L, Zhang T. Multi-target tracking of person based on deep learning. Comput Syst Sci Eng. 2023;47(2):2671-2688 https://doi.org/10.32604/csse.2023.038154

IEEE Style

X. Li, G. Fang, L. Rao, and T. Zhang "Multi-Target Tracking of Person Based on Deep Learning," Comput. Syst. Sci. Eng., vol. 47, no. 2, pp. 2671-2688. 2023. https://doi.org/10.32604/csse.2023.038154

BibTex EndNote RIS

This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Multi-Target Tracking of Person Based on Deep Learning

Abstract

Keywords

Cite This Article

470

269

2

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link