Cargando…

Asymmetric Adaptive Fusion in a Two-Stream Network for RGB-D Human Detection

In recent years, human detection in indoor scenes has been widely applied in smart buildings and smart security, but many related challenges can still be difficult to address, such as frequent occlusion, low illumination and multiple poses. This paper proposes an asymmetric adaptive fusion two-strea...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Wenli, Guo, Xiang, Wang, Jiaqi, Wang, Ning, Chen, Kaizhen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7866388/
https://www.ncbi.nlm.nih.gov/pubmed/33572928
http://dx.doi.org/10.3390/s21030916
_version_ 1783648064669483008
author Zhang, Wenli
Guo, Xiang
Wang, Jiaqi
Wang, Ning
Chen, Kaizhen
author_facet Zhang, Wenli
Guo, Xiang
Wang, Jiaqi
Wang, Ning
Chen, Kaizhen
author_sort Zhang, Wenli
collection PubMed
description In recent years, human detection in indoor scenes has been widely applied in smart buildings and smart security, but many related challenges can still be difficult to address, such as frequent occlusion, low illumination and multiple poses. This paper proposes an asymmetric adaptive fusion two-stream network (AAFTS-net) for RGB-D human detection. This network can fully extract person-specific depth features and RGB features while reducing the typical complexity of a two-stream network. A depth feature pyramid is constructed by combining contextual information, with the motivation of combining multiscale depth features to improve the adaptability for targets of different sizes. An adaptive channel weighting (ACW) module weights the RGB-D feature channels to achieve efficient feature selection and information complementation. This paper also introduces a novel RGB-D dataset for human detection called RGBD-human, on which we verify the performance of the proposed algorithm. The experimental results show that AAFTS-net outperforms existing state-of-the-art methods and can maintain stable performance under conditions of frequent occlusion, low illumination and multiple poses.
format Online
Article
Text
id pubmed-7866388
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-78663882021-02-07 Asymmetric Adaptive Fusion in a Two-Stream Network for RGB-D Human Detection Zhang, Wenli Guo, Xiang Wang, Jiaqi Wang, Ning Chen, Kaizhen Sensors (Basel) Article In recent years, human detection in indoor scenes has been widely applied in smart buildings and smart security, but many related challenges can still be difficult to address, such as frequent occlusion, low illumination and multiple poses. This paper proposes an asymmetric adaptive fusion two-stream network (AAFTS-net) for RGB-D human detection. This network can fully extract person-specific depth features and RGB features while reducing the typical complexity of a two-stream network. A depth feature pyramid is constructed by combining contextual information, with the motivation of combining multiscale depth features to improve the adaptability for targets of different sizes. An adaptive channel weighting (ACW) module weights the RGB-D feature channels to achieve efficient feature selection and information complementation. This paper also introduces a novel RGB-D dataset for human detection called RGBD-human, on which we verify the performance of the proposed algorithm. The experimental results show that AAFTS-net outperforms existing state-of-the-art methods and can maintain stable performance under conditions of frequent occlusion, low illumination and multiple poses. MDPI 2021-01-29 /pmc/articles/PMC7866388/ /pubmed/33572928 http://dx.doi.org/10.3390/s21030916 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Wenli
Guo, Xiang
Wang, Jiaqi
Wang, Ning
Chen, Kaizhen
Asymmetric Adaptive Fusion in a Two-Stream Network for RGB-D Human Detection
title Asymmetric Adaptive Fusion in a Two-Stream Network for RGB-D Human Detection
title_full Asymmetric Adaptive Fusion in a Two-Stream Network for RGB-D Human Detection
title_fullStr Asymmetric Adaptive Fusion in a Two-Stream Network for RGB-D Human Detection
title_full_unstemmed Asymmetric Adaptive Fusion in a Two-Stream Network for RGB-D Human Detection
title_short Asymmetric Adaptive Fusion in a Two-Stream Network for RGB-D Human Detection
title_sort asymmetric adaptive fusion in a two-stream network for rgb-d human detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7866388/
https://www.ncbi.nlm.nih.gov/pubmed/33572928
http://dx.doi.org/10.3390/s21030916
work_keys_str_mv AT zhangwenli asymmetricadaptivefusioninatwostreamnetworkforrgbdhumandetection
AT guoxiang asymmetricadaptivefusioninatwostreamnetworkforrgbdhumandetection
AT wangjiaqi asymmetricadaptivefusioninatwostreamnetworkforrgbdhumandetection
AT wangning asymmetricadaptivefusioninatwostreamnetworkforrgbdhumandetection
AT chenkaizhen asymmetricadaptivefusioninatwostreamnetworkforrgbdhumandetection