Cargando…

End-to-End Network for Pedestrian Detection, Tracking and Re-Identification in Real-Time Surveillance System

Surveillance video has been widely used in business, security, search, and other fields. Identifying and locating specific pedestrians in surveillance video has an important application value in criminal investigation, search and rescue, etc. However, the requirements for real-time capturing and acc...

Descripción completa

Detalles Bibliográficos
Autores principales: Lei, Mingwei, Song, Yongchao, Zhao, Jindong, Wang, Xuan, Lyu, Jun, Xu, Jindong, Yan, Weiqing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9698255/
https://www.ncbi.nlm.nih.gov/pubmed/36433291
http://dx.doi.org/10.3390/s22228693
_version_ 1784838770458099712
author Lei, Mingwei
Song, Yongchao
Zhao, Jindong
Wang, Xuan
Lyu, Jun
Xu, Jindong
Yan, Weiqing
author_facet Lei, Mingwei
Song, Yongchao
Zhao, Jindong
Wang, Xuan
Lyu, Jun
Xu, Jindong
Yan, Weiqing
author_sort Lei, Mingwei
collection PubMed
description Surveillance video has been widely used in business, security, search, and other fields. Identifying and locating specific pedestrians in surveillance video has an important application value in criminal investigation, search and rescue, etc. However, the requirements for real-time capturing and accuracy are high for these applications. It is essential to build a complete and smooth system to combine pedestrian detection, tracking and re-identification to achieve the goal of maximizing efficiency by balancing real-time capture and accuracy. This paper combined the detector and Re-ID models into a single end-to-end network by introducing a new track branch to YOLOv5 architecture for tracking. For pedestrian detection, we employed the weighted bi-directional feature pyramid network (BiFPN) to enhance the network structure based on the YOLOv5-Lite, which is able to further improve the ability of feature extraction. For tracking, based on Deepsort, this paper enhanced the tracker, which uses the Noise Scale Adaptive (NSA) Kalman filter to track, and adds adaptive noise to strengthen the anti-interference of the tracking model. In addition, the matching strategy is further updated. For pedestrian re-identification, the network structure of Fastreid was modified, which can increase the feature extraction speed of the improved algorithm by leaps and bounds. Using the proposed unified network, the parameters of the entire model can be trained in an end-to-end method with the multi-loss function, which has been demonstrated to be quite valuable in some other recent works. Experimental results demonstrate that pedestrians detection can obtain a 97% mean Average Precision (mAP) and that it can track the pedestrians well with a 98.3% MOTA and a 99.8% MOTP on the MOT16 dataset; furthermore, high pedestrian re-identification performance can be achieved on the VERI-Wild dataset with a 77.3% mAP. The overall framework proposed in this paper has remarkable performance in terms of the precise localization and real-time detection of specific pedestrians across time, regions, and cameras.
format Online
Article
Text
id pubmed-9698255
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96982552022-11-26 End-to-End Network for Pedestrian Detection, Tracking and Re-Identification in Real-Time Surveillance System Lei, Mingwei Song, Yongchao Zhao, Jindong Wang, Xuan Lyu, Jun Xu, Jindong Yan, Weiqing Sensors (Basel) Article Surveillance video has been widely used in business, security, search, and other fields. Identifying and locating specific pedestrians in surveillance video has an important application value in criminal investigation, search and rescue, etc. However, the requirements for real-time capturing and accuracy are high for these applications. It is essential to build a complete and smooth system to combine pedestrian detection, tracking and re-identification to achieve the goal of maximizing efficiency by balancing real-time capture and accuracy. This paper combined the detector and Re-ID models into a single end-to-end network by introducing a new track branch to YOLOv5 architecture for tracking. For pedestrian detection, we employed the weighted bi-directional feature pyramid network (BiFPN) to enhance the network structure based on the YOLOv5-Lite, which is able to further improve the ability of feature extraction. For tracking, based on Deepsort, this paper enhanced the tracker, which uses the Noise Scale Adaptive (NSA) Kalman filter to track, and adds adaptive noise to strengthen the anti-interference of the tracking model. In addition, the matching strategy is further updated. For pedestrian re-identification, the network structure of Fastreid was modified, which can increase the feature extraction speed of the improved algorithm by leaps and bounds. Using the proposed unified network, the parameters of the entire model can be trained in an end-to-end method with the multi-loss function, which has been demonstrated to be quite valuable in some other recent works. Experimental results demonstrate that pedestrians detection can obtain a 97% mean Average Precision (mAP) and that it can track the pedestrians well with a 98.3% MOTA and a 99.8% MOTP on the MOT16 dataset; furthermore, high pedestrian re-identification performance can be achieved on the VERI-Wild dataset with a 77.3% mAP. The overall framework proposed in this paper has remarkable performance in terms of the precise localization and real-time detection of specific pedestrians across time, regions, and cameras. MDPI 2022-11-10 /pmc/articles/PMC9698255/ /pubmed/36433291 http://dx.doi.org/10.3390/s22228693 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lei, Mingwei
Song, Yongchao
Zhao, Jindong
Wang, Xuan
Lyu, Jun
Xu, Jindong
Yan, Weiqing
End-to-End Network for Pedestrian Detection, Tracking and Re-Identification in Real-Time Surveillance System
title End-to-End Network for Pedestrian Detection, Tracking and Re-Identification in Real-Time Surveillance System
title_full End-to-End Network for Pedestrian Detection, Tracking and Re-Identification in Real-Time Surveillance System
title_fullStr End-to-End Network for Pedestrian Detection, Tracking and Re-Identification in Real-Time Surveillance System
title_full_unstemmed End-to-End Network for Pedestrian Detection, Tracking and Re-Identification in Real-Time Surveillance System
title_short End-to-End Network for Pedestrian Detection, Tracking and Re-Identification in Real-Time Surveillance System
title_sort end-to-end network for pedestrian detection, tracking and re-identification in real-time surveillance system
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9698255/
https://www.ncbi.nlm.nih.gov/pubmed/36433291
http://dx.doi.org/10.3390/s22228693
work_keys_str_mv AT leimingwei endtoendnetworkforpedestriandetectiontrackingandreidentificationinrealtimesurveillancesystem
AT songyongchao endtoendnetworkforpedestriandetectiontrackingandreidentificationinrealtimesurveillancesystem
AT zhaojindong endtoendnetworkforpedestriandetectiontrackingandreidentificationinrealtimesurveillancesystem
AT wangxuan endtoendnetworkforpedestriandetectiontrackingandreidentificationinrealtimesurveillancesystem
AT lyujun endtoendnetworkforpedestriandetectiontrackingandreidentificationinrealtimesurveillancesystem
AT xujindong endtoendnetworkforpedestriandetectiontrackingandreidentificationinrealtimesurveillancesystem
AT yanweiqing endtoendnetworkforpedestriandetectiontrackingandreidentificationinrealtimesurveillancesystem