Cargando…

Inception Convolution and Feature Fusion for Person Search

With the rapid advancement of deep learning theory and hardware device computing capacity, computer vision tasks, such as object detection and instance segmentation, have entered a revolutionary phase in recent years. As a result, extremely challenging integrated tasks, such as person search, might...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ouyang, Huan, Zeng, Jiexian, Leng, Lu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9963104/ https://www.ncbi.nlm.nih.gov/pubmed/36850579 http://dx.doi.org/10.3390/s23041984

_version_	1784896168603418624
author	Ouyang, Huan Zeng, Jiexian Leng, Lu
author_facet	Ouyang, Huan Zeng, Jiexian Leng, Lu
author_sort	Ouyang, Huan
collection	PubMed
description	With the rapid advancement of deep learning theory and hardware device computing capacity, computer vision tasks, such as object detection and instance segmentation, have entered a revolutionary phase in recent years. As a result, extremely challenging integrated tasks, such as person search, might develop quickly. The majority of efficient network frameworks, such as Seq-Net, are based on Faster R-CNN. However, because of the parallel structure of Faster R-CNN, the performance of re-ID can be significantly impacted by the single-layer, low resolution, and occasionally overlooked check feature diagrams retrieved during pedestrian detection. To address these issues, this paper proposed a person search methodology based on an inception convolution and feature fusion module (IC-FFM) using Seq-Net (Sequential End-to-end Network) as the benchmark. First, we replaced the general convolution in ResNet-50 with the new inception convolution module (ICM), allowing the convolution operation to effectively and dynamically distribute various channels. Then, to improve the accuracy of information extraction, the feature fusion module (FFM) was created to combine multi-level information using various levels of convolution. Finally, Bounding Box regression was created using convolution and the double-head module (DHM), which considerably enhanced the accuracy of pedestrian retrieval by combining global and fine-grained information. Experiments on CHUK-SYSU and PRW datasets showed that our method has higher accuracy than Seq-Net. In addition, our method is simpler and can be easily integrated into existing two-stage frameworks.
format	Online Article Text
id	pubmed-9963104
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-99631042023-02-26 Inception Convolution and Feature Fusion for Person Search Ouyang, Huan Zeng, Jiexian Leng, Lu Sensors (Basel) Article With the rapid advancement of deep learning theory and hardware device computing capacity, computer vision tasks, such as object detection and instance segmentation, have entered a revolutionary phase in recent years. As a result, extremely challenging integrated tasks, such as person search, might develop quickly. The majority of efficient network frameworks, such as Seq-Net, are based on Faster R-CNN. However, because of the parallel structure of Faster R-CNN, the performance of re-ID can be significantly impacted by the single-layer, low resolution, and occasionally overlooked check feature diagrams retrieved during pedestrian detection. To address these issues, this paper proposed a person search methodology based on an inception convolution and feature fusion module (IC-FFM) using Seq-Net (Sequential End-to-end Network) as the benchmark. First, we replaced the general convolution in ResNet-50 with the new inception convolution module (ICM), allowing the convolution operation to effectively and dynamically distribute various channels. Then, to improve the accuracy of information extraction, the feature fusion module (FFM) was created to combine multi-level information using various levels of convolution. Finally, Bounding Box regression was created using convolution and the double-head module (DHM), which considerably enhanced the accuracy of pedestrian retrieval by combining global and fine-grained information. Experiments on CHUK-SYSU and PRW datasets showed that our method has higher accuracy than Seq-Net. In addition, our method is simpler and can be easily integrated into existing two-stage frameworks. MDPI 2023-02-10 /pmc/articles/PMC9963104/ /pubmed/36850579 http://dx.doi.org/10.3390/s23041984 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Ouyang, Huan Zeng, Jiexian Leng, Lu Inception Convolution and Feature Fusion for Person Search
title	Inception Convolution and Feature Fusion for Person Search
title_full	Inception Convolution and Feature Fusion for Person Search
title_fullStr	Inception Convolution and Feature Fusion for Person Search
title_full_unstemmed	Inception Convolution and Feature Fusion for Person Search
title_short	Inception Convolution and Feature Fusion for Person Search
title_sort	inception convolution and feature fusion for person search
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9963104/ https://www.ncbi.nlm.nih.gov/pubmed/36850579 http://dx.doi.org/10.3390/s23041984
work_keys_str_mv	AT ouyanghuan inceptionconvolutionandfeaturefusionforpersonsearch AT zengjiexian inceptionconvolutionandfeaturefusionforpersonsearch AT lenglu inceptionconvolutionandfeaturefusionforpersonsearch

Inception Convolution and Feature Fusion for Person Search

Ejemplares similares