Cargando…

Multistage Deep Neural Network Framework for People Detection and Localization Using Fusion of Visible and Thermal Images

In Computer vision object detection and classification are active fields of research. Applications of object detection and classification include a diverse range of fields such as surveillance, autonomous cars and robotic vision. Many intelligent systems are built by researchers to achieve the accur...

Descripción completa

Detalles Bibliográficos
Autores principales: Khalid, Bushra, Akram, Muhammad Usman, Khan, Asad Mansoor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7340928/
http://dx.doi.org/10.1007/978-3-030-51935-3_15
_version_ 1783555124597096448
author Khalid, Bushra
Akram, Muhammad Usman
Khan, Asad Mansoor
author_facet Khalid, Bushra
Akram, Muhammad Usman
Khan, Asad Mansoor
author_sort Khalid, Bushra
collection PubMed
description In Computer vision object detection and classification are active fields of research. Applications of object detection and classification include a diverse range of fields such as surveillance, autonomous cars and robotic vision. Many intelligent systems are built by researchers to achieve the accuracy of human perception but could not quite achieve it yet. Convolutional Neural Networks (CNN) and Deep Learning architectures are used to achieve human like perception for object detection and scene identification. We are proposing a novel method by combining previously used techniques. We are proposing a model which takes multi-spectral images, fuses them together, drops the useless images and then provides semantic segmentation for each object (person) present in the image. In our proposed methodology we are using CNN for fusion of Visible and thermal images and Deep Learning architectures for classification and localization. Fusion of visible and thermal images is carried out to combine informative features of both images into one image. For fusion we are using Encoder-decoder architecture. Fused image is then fed into Resnet-152 architecture for classification of images. Images obtained from Resnet-152 are then fed into Mask-RCNN for localization of persons. Mask-RCNN uses Resnet-101 architecture for localization of objects. From the results it can be clearly seen that Fused model for object localization outperforms the Visible model and gives promising results for person detection for surveillance purposes. Our proposed model gives the Miss Rate of 5.25% which is much better than the previous state of the art method applied on KAIST dataset.
format Online
Article
Text
id pubmed-7340928
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73409282020-07-08 Multistage Deep Neural Network Framework for People Detection and Localization Using Fusion of Visible and Thermal Images Khalid, Bushra Akram, Muhammad Usman Khan, Asad Mansoor Image and Signal Processing Article In Computer vision object detection and classification are active fields of research. Applications of object detection and classification include a diverse range of fields such as surveillance, autonomous cars and robotic vision. Many intelligent systems are built by researchers to achieve the accuracy of human perception but could not quite achieve it yet. Convolutional Neural Networks (CNN) and Deep Learning architectures are used to achieve human like perception for object detection and scene identification. We are proposing a novel method by combining previously used techniques. We are proposing a model which takes multi-spectral images, fuses them together, drops the useless images and then provides semantic segmentation for each object (person) present in the image. In our proposed methodology we are using CNN for fusion of Visible and thermal images and Deep Learning architectures for classification and localization. Fusion of visible and thermal images is carried out to combine informative features of both images into one image. For fusion we are using Encoder-decoder architecture. Fused image is then fed into Resnet-152 architecture for classification of images. Images obtained from Resnet-152 are then fed into Mask-RCNN for localization of persons. Mask-RCNN uses Resnet-101 architecture for localization of objects. From the results it can be clearly seen that Fused model for object localization outperforms the Visible model and gives promising results for person detection for surveillance purposes. Our proposed model gives the Miss Rate of 5.25% which is much better than the previous state of the art method applied on KAIST dataset. 2020-06-05 /pmc/articles/PMC7340928/ http://dx.doi.org/10.1007/978-3-030-51935-3_15 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Khalid, Bushra
Akram, Muhammad Usman
Khan, Asad Mansoor
Multistage Deep Neural Network Framework for People Detection and Localization Using Fusion of Visible and Thermal Images
title Multistage Deep Neural Network Framework for People Detection and Localization Using Fusion of Visible and Thermal Images
title_full Multistage Deep Neural Network Framework for People Detection and Localization Using Fusion of Visible and Thermal Images
title_fullStr Multistage Deep Neural Network Framework for People Detection and Localization Using Fusion of Visible and Thermal Images
title_full_unstemmed Multistage Deep Neural Network Framework for People Detection and Localization Using Fusion of Visible and Thermal Images
title_short Multistage Deep Neural Network Framework for People Detection and Localization Using Fusion of Visible and Thermal Images
title_sort multistage deep neural network framework for people detection and localization using fusion of visible and thermal images
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7340928/
http://dx.doi.org/10.1007/978-3-030-51935-3_15
work_keys_str_mv AT khalidbushra multistagedeepneuralnetworkframeworkforpeopledetectionandlocalizationusingfusionofvisibleandthermalimages
AT akrammuhammadusman multistagedeepneuralnetworkframeworkforpeopledetectionandlocalizationusingfusionofvisibleandthermalimages
AT khanasadmansoor multistagedeepneuralnetworkframeworkforpeopledetectionandlocalizationusingfusionofvisibleandthermalimages