Cargando…

Efficient Network Architecture Search Using Hybrid Optimizer

Manually designing a convolutional neural network (CNN) is an important deep learning method for solving the problem of image classification. However, most of the existing CNN structure designs consume a significant amount of time and computing resources. Over the years, the demand for neural archit...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Ting-Ting, Chu, Shu-Chuan, Hu, Chia-Cheng, Jia, Han-Dong, Pan, Jeng-Shyang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9140713/
https://www.ncbi.nlm.nih.gov/pubmed/35626541
http://dx.doi.org/10.3390/e24050656
_version_ 1784715165557587968
author Wang, Ting-Ting
Chu, Shu-Chuan
Hu, Chia-Cheng
Jia, Han-Dong
Pan, Jeng-Shyang
author_facet Wang, Ting-Ting
Chu, Shu-Chuan
Hu, Chia-Cheng
Jia, Han-Dong
Pan, Jeng-Shyang
author_sort Wang, Ting-Ting
collection PubMed
description Manually designing a convolutional neural network (CNN) is an important deep learning method for solving the problem of image classification. However, most of the existing CNN structure designs consume a significant amount of time and computing resources. Over the years, the demand for neural architecture search (NAS) methods has been on the rise. Therefore, we propose a novel deep architecture generation model based on Aquila optimization (AO) and a genetic algorithm (GA). The main contributions of this paper are as follows: Firstly, a new encoding strategy representing the CNN coding structure is proposed, so that the evolutionary computing algorithm can be combined with CNN. Secondly, a new mechanism for updating location is proposed, which incorporates three typical operators from GA cleverly into the model we have designed so that the model can find the optimal solution in the limited search space. Thirdly, the proposed method can deal with the variable-length CNN structure by adding skip connections. Fourthly, combining traditional CNN layers and residual blocks and introducing a grouping strategy provides greater possibilities for searching for the optimal CNN structure. Additionally, we use two notable datasets, consisting of the MNIST and CIFAR-10 datasets for model evaluation. The experimental results show that our proposed model has good results in terms of search accuracy and time.
format Online
Article
Text
id pubmed-9140713
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91407132022-05-28 Efficient Network Architecture Search Using Hybrid Optimizer Wang, Ting-Ting Chu, Shu-Chuan Hu, Chia-Cheng Jia, Han-Dong Pan, Jeng-Shyang Entropy (Basel) Article Manually designing a convolutional neural network (CNN) is an important deep learning method for solving the problem of image classification. However, most of the existing CNN structure designs consume a significant amount of time and computing resources. Over the years, the demand for neural architecture search (NAS) methods has been on the rise. Therefore, we propose a novel deep architecture generation model based on Aquila optimization (AO) and a genetic algorithm (GA). The main contributions of this paper are as follows: Firstly, a new encoding strategy representing the CNN coding structure is proposed, so that the evolutionary computing algorithm can be combined with CNN. Secondly, a new mechanism for updating location is proposed, which incorporates three typical operators from GA cleverly into the model we have designed so that the model can find the optimal solution in the limited search space. Thirdly, the proposed method can deal with the variable-length CNN structure by adding skip connections. Fourthly, combining traditional CNN layers and residual blocks and introducing a grouping strategy provides greater possibilities for searching for the optimal CNN structure. Additionally, we use two notable datasets, consisting of the MNIST and CIFAR-10 datasets for model evaluation. The experimental results show that our proposed model has good results in terms of search accuracy and time. MDPI 2022-05-06 /pmc/articles/PMC9140713/ /pubmed/35626541 http://dx.doi.org/10.3390/e24050656 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Ting-Ting
Chu, Shu-Chuan
Hu, Chia-Cheng
Jia, Han-Dong
Pan, Jeng-Shyang
Efficient Network Architecture Search Using Hybrid Optimizer
title Efficient Network Architecture Search Using Hybrid Optimizer
title_full Efficient Network Architecture Search Using Hybrid Optimizer
title_fullStr Efficient Network Architecture Search Using Hybrid Optimizer
title_full_unstemmed Efficient Network Architecture Search Using Hybrid Optimizer
title_short Efficient Network Architecture Search Using Hybrid Optimizer
title_sort efficient network architecture search using hybrid optimizer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9140713/
https://www.ncbi.nlm.nih.gov/pubmed/35626541
http://dx.doi.org/10.3390/e24050656
work_keys_str_mv AT wangtingting efficientnetworkarchitecturesearchusinghybridoptimizer
AT chushuchuan efficientnetworkarchitecturesearchusinghybridoptimizer
AT huchiacheng efficientnetworkarchitecturesearchusinghybridoptimizer
AT jiahandong efficientnetworkarchitecturesearchusinghybridoptimizer
AT panjengshyang efficientnetworkarchitecturesearchusinghybridoptimizer