Cargando…
Efficient Network Architecture Search Using Hybrid Optimizer
Manually designing a convolutional neural network (CNN) is an important deep learning method for solving the problem of image classification. However, most of the existing CNN structure designs consume a significant amount of time and computing resources. Over the years, the demand for neural archit...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9140713/ https://www.ncbi.nlm.nih.gov/pubmed/35626541 http://dx.doi.org/10.3390/e24050656 |
_version_ | 1784715165557587968 |
---|---|
author | Wang, Ting-Ting Chu, Shu-Chuan Hu, Chia-Cheng Jia, Han-Dong Pan, Jeng-Shyang |
author_facet | Wang, Ting-Ting Chu, Shu-Chuan Hu, Chia-Cheng Jia, Han-Dong Pan, Jeng-Shyang |
author_sort | Wang, Ting-Ting |
collection | PubMed |
description | Manually designing a convolutional neural network (CNN) is an important deep learning method for solving the problem of image classification. However, most of the existing CNN structure designs consume a significant amount of time and computing resources. Over the years, the demand for neural architecture search (NAS) methods has been on the rise. Therefore, we propose a novel deep architecture generation model based on Aquila optimization (AO) and a genetic algorithm (GA). The main contributions of this paper are as follows: Firstly, a new encoding strategy representing the CNN coding structure is proposed, so that the evolutionary computing algorithm can be combined with CNN. Secondly, a new mechanism for updating location is proposed, which incorporates three typical operators from GA cleverly into the model we have designed so that the model can find the optimal solution in the limited search space. Thirdly, the proposed method can deal with the variable-length CNN structure by adding skip connections. Fourthly, combining traditional CNN layers and residual blocks and introducing a grouping strategy provides greater possibilities for searching for the optimal CNN structure. Additionally, we use two notable datasets, consisting of the MNIST and CIFAR-10 datasets for model evaluation. The experimental results show that our proposed model has good results in terms of search accuracy and time. |
format | Online Article Text |
id | pubmed-9140713 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-91407132022-05-28 Efficient Network Architecture Search Using Hybrid Optimizer Wang, Ting-Ting Chu, Shu-Chuan Hu, Chia-Cheng Jia, Han-Dong Pan, Jeng-Shyang Entropy (Basel) Article Manually designing a convolutional neural network (CNN) is an important deep learning method for solving the problem of image classification. However, most of the existing CNN structure designs consume a significant amount of time and computing resources. Over the years, the demand for neural architecture search (NAS) methods has been on the rise. Therefore, we propose a novel deep architecture generation model based on Aquila optimization (AO) and a genetic algorithm (GA). The main contributions of this paper are as follows: Firstly, a new encoding strategy representing the CNN coding structure is proposed, so that the evolutionary computing algorithm can be combined with CNN. Secondly, a new mechanism for updating location is proposed, which incorporates three typical operators from GA cleverly into the model we have designed so that the model can find the optimal solution in the limited search space. Thirdly, the proposed method can deal with the variable-length CNN structure by adding skip connections. Fourthly, combining traditional CNN layers and residual blocks and introducing a grouping strategy provides greater possibilities for searching for the optimal CNN structure. Additionally, we use two notable datasets, consisting of the MNIST and CIFAR-10 datasets for model evaluation. The experimental results show that our proposed model has good results in terms of search accuracy and time. MDPI 2022-05-06 /pmc/articles/PMC9140713/ /pubmed/35626541 http://dx.doi.org/10.3390/e24050656 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Ting-Ting Chu, Shu-Chuan Hu, Chia-Cheng Jia, Han-Dong Pan, Jeng-Shyang Efficient Network Architecture Search Using Hybrid Optimizer |
title | Efficient Network Architecture Search Using Hybrid Optimizer |
title_full | Efficient Network Architecture Search Using Hybrid Optimizer |
title_fullStr | Efficient Network Architecture Search Using Hybrid Optimizer |
title_full_unstemmed | Efficient Network Architecture Search Using Hybrid Optimizer |
title_short | Efficient Network Architecture Search Using Hybrid Optimizer |
title_sort | efficient network architecture search using hybrid optimizer |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9140713/ https://www.ncbi.nlm.nih.gov/pubmed/35626541 http://dx.doi.org/10.3390/e24050656 |
work_keys_str_mv | AT wangtingting efficientnetworkarchitecturesearchusinghybridoptimizer AT chushuchuan efficientnetworkarchitecturesearchusinghybridoptimizer AT huchiacheng efficientnetworkarchitecturesearchusinghybridoptimizer AT jiahandong efficientnetworkarchitecturesearchusinghybridoptimizer AT panjengshyang efficientnetworkarchitecturesearchusinghybridoptimizer |