Cargando…
A Review on Multiscale-Deep-Learning Applications
In general, most of the existing convolutional neural network (CNN)-based deep-learning models suffer from spatial-information loss and inadequate feature-representation issues. This is due to their inability to capture multiscale-context information and the exclusion of semantic information through...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9573412/ https://www.ncbi.nlm.nih.gov/pubmed/36236483 http://dx.doi.org/10.3390/s22197384 |
_version_ | 1784810863396388864 |
---|---|
author | Elizar, Elizar Zulkifley, Mohd Asyraf Muharar, Rusdha Zaman, Mohd Hairi Mohd Mustaza, Seri Mastura |
author_facet | Elizar, Elizar Zulkifley, Mohd Asyraf Muharar, Rusdha Zaman, Mohd Hairi Mohd Mustaza, Seri Mastura |
author_sort | Elizar, Elizar |
collection | PubMed |
description | In general, most of the existing convolutional neural network (CNN)-based deep-learning models suffer from spatial-information loss and inadequate feature-representation issues. This is due to their inability to capture multiscale-context information and the exclusion of semantic information throughout the pooling operations. In the early layers of a CNN, the network encodes simple semantic representations, such as edges and corners, while, in the latter part of the CNN, the network encodes more complex semantic features, such as complex geometric shapes. Theoretically, it is better for a CNN to extract features from different levels of semantic representation because tasks such as classification and segmentation work better when both simple and complex feature maps are utilized. Hence, it is also crucial to embed multiscale capability throughout the network so that the various scales of the features can be optimally captured to represent the intended task. Multiscale representation enables the network to fuse low-level and high-level features from a restricted receptive field to enhance the deep-model performance. The main novelty of this review is the comprehensive novel taxonomy of multiscale-deep-learning methods, which includes details of several architectures and their strengths that have been implemented in the existing works. Predominantly, multiscale approaches in deep-learning networks can be classed into two categories: multiscale feature learning and multiscale feature fusion. Multiscale feature learning refers to the method of deriving feature maps by examining kernels over several sizes to collect a larger range of relevant features and predict the input images’ spatial mapping. Multiscale feature fusion uses features with different resolutions to find patterns over short and long distances, without a deep network. Additionally, several examples of the techniques are also discussed according to their applications in satellite imagery, medical imaging, agriculture, and industrial and manufacturing systems. |
format | Online Article Text |
id | pubmed-9573412 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-95734122022-10-17 A Review on Multiscale-Deep-Learning Applications Elizar, Elizar Zulkifley, Mohd Asyraf Muharar, Rusdha Zaman, Mohd Hairi Mohd Mustaza, Seri Mastura Sensors (Basel) Review In general, most of the existing convolutional neural network (CNN)-based deep-learning models suffer from spatial-information loss and inadequate feature-representation issues. This is due to their inability to capture multiscale-context information and the exclusion of semantic information throughout the pooling operations. In the early layers of a CNN, the network encodes simple semantic representations, such as edges and corners, while, in the latter part of the CNN, the network encodes more complex semantic features, such as complex geometric shapes. Theoretically, it is better for a CNN to extract features from different levels of semantic representation because tasks such as classification and segmentation work better when both simple and complex feature maps are utilized. Hence, it is also crucial to embed multiscale capability throughout the network so that the various scales of the features can be optimally captured to represent the intended task. Multiscale representation enables the network to fuse low-level and high-level features from a restricted receptive field to enhance the deep-model performance. The main novelty of this review is the comprehensive novel taxonomy of multiscale-deep-learning methods, which includes details of several architectures and their strengths that have been implemented in the existing works. Predominantly, multiscale approaches in deep-learning networks can be classed into two categories: multiscale feature learning and multiscale feature fusion. Multiscale feature learning refers to the method of deriving feature maps by examining kernels over several sizes to collect a larger range of relevant features and predict the input images’ spatial mapping. Multiscale feature fusion uses features with different resolutions to find patterns over short and long distances, without a deep network. Additionally, several examples of the techniques are also discussed according to their applications in satellite imagery, medical imaging, agriculture, and industrial and manufacturing systems. MDPI 2022-09-28 /pmc/articles/PMC9573412/ /pubmed/36236483 http://dx.doi.org/10.3390/s22197384 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Review Elizar, Elizar Zulkifley, Mohd Asyraf Muharar, Rusdha Zaman, Mohd Hairi Mohd Mustaza, Seri Mastura A Review on Multiscale-Deep-Learning Applications |
title | A Review on Multiscale-Deep-Learning Applications |
title_full | A Review on Multiscale-Deep-Learning Applications |
title_fullStr | A Review on Multiscale-Deep-Learning Applications |
title_full_unstemmed | A Review on Multiscale-Deep-Learning Applications |
title_short | A Review on Multiscale-Deep-Learning Applications |
title_sort | review on multiscale-deep-learning applications |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9573412/ https://www.ncbi.nlm.nih.gov/pubmed/36236483 http://dx.doi.org/10.3390/s22197384 |
work_keys_str_mv | AT elizarelizar areviewonmultiscaledeeplearningapplications AT zulkifleymohdasyraf areviewonmultiscaledeeplearningapplications AT muhararrusdha areviewonmultiscaledeeplearningapplications AT zamanmohdhairimohd areviewonmultiscaledeeplearningapplications AT mustazaserimastura areviewonmultiscaledeeplearningapplications AT elizarelizar reviewonmultiscaledeeplearningapplications AT zulkifleymohdasyraf reviewonmultiscaledeeplearningapplications AT muhararrusdha reviewonmultiscaledeeplearningapplications AT zamanmohdhairimohd reviewonmultiscaledeeplearningapplications AT mustazaserimastura reviewonmultiscaledeeplearningapplications |