Cargando…

Micro-architecture design exploration template for AutoML case study on SqueezeSEMAuto

Convolutional Neural Network (CNN) models have been commonly used primarily in image recognition tasks in the deep learning area. Finding the right architecture needs a lot of hand-tune experiments which are time-consuming. In this paper, we exploit an AutoML framework that adds to the exploration o...

Descripción completa

Detalles Bibliográficos
Autores principales: Chantrapornchai, Chantana, Kajkamhaeng, Supasit, Romphet, Phattharaphon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10313661/
https://www.ncbi.nlm.nih.gov/pubmed/37391458
http://dx.doi.org/10.1038/s41598-023-37682-0
_version_ 1785067164130082816
author Chantrapornchai, Chantana
Kajkamhaeng, Supasit
Romphet, Phattharaphon
author_facet Chantrapornchai, Chantana
Kajkamhaeng, Supasit
Romphet, Phattharaphon
author_sort Chantrapornchai, Chantana
collection PubMed
description Convolutional Neural Network (CNN) models have been commonly used primarily in image recognition tasks in the deep learning area. Finding the right architecture needs a lot of hand-tune experiments which are time-consuming. In this paper, we exploit an AutoML framework that adds to the exploration of the micro-architecture block and the multi-input option. The proposed adaption has been applied to SqueezeNet with SE blocks combined with the residual block combinations. The experiments assume three search strategies: Random, Hyperband, and Bayesian algorithms. Such combinations can lead to solutions with superior accuracy while the model size can be monitored. We demonstrate the application of the approach against benchmarks: CIFAR-10 and Tsinghua Facial Expression datasets. The searches allow the designer to find the architectures with better accuracy than the traditional architectures without hand-tune efforts. For example, CIFAR-10, leads to the SqueezeNet architecture using only 4 fire modules with 59% accuracy. When exploring SE block insertion, the model with good insertion points can lead to an accuracy of 78% while the traditional SqueezeNet can achieve an accuracy of around 50%. For other tasks, such as facial expression recognition, the proposed approach can lead up to an accuracy of 71% with the proper insertion of SE blocks, the appropriate number of fire modules, and adequate input merging, while the traditional model can achieve the accuracy under 20%.
format Online
Article
Text
id pubmed-10313661
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-103136612023-07-02 Micro-architecture design exploration template for AutoML case study on SqueezeSEMAuto Chantrapornchai, Chantana Kajkamhaeng, Supasit Romphet, Phattharaphon Sci Rep Article Convolutional Neural Network (CNN) models have been commonly used primarily in image recognition tasks in the deep learning area. Finding the right architecture needs a lot of hand-tune experiments which are time-consuming. In this paper, we exploit an AutoML framework that adds to the exploration of the micro-architecture block and the multi-input option. The proposed adaption has been applied to SqueezeNet with SE blocks combined with the residual block combinations. The experiments assume three search strategies: Random, Hyperband, and Bayesian algorithms. Such combinations can lead to solutions with superior accuracy while the model size can be monitored. We demonstrate the application of the approach against benchmarks: CIFAR-10 and Tsinghua Facial Expression datasets. The searches allow the designer to find the architectures with better accuracy than the traditional architectures without hand-tune efforts. For example, CIFAR-10, leads to the SqueezeNet architecture using only 4 fire modules with 59% accuracy. When exploring SE block insertion, the model with good insertion points can lead to an accuracy of 78% while the traditional SqueezeNet can achieve an accuracy of around 50%. For other tasks, such as facial expression recognition, the proposed approach can lead up to an accuracy of 71% with the proper insertion of SE blocks, the appropriate number of fire modules, and adequate input merging, while the traditional model can achieve the accuracy under 20%. Nature Publishing Group UK 2023-06-30 /pmc/articles/PMC10313661/ /pubmed/37391458 http://dx.doi.org/10.1038/s41598-023-37682-0 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Chantrapornchai, Chantana
Kajkamhaeng, Supasit
Romphet, Phattharaphon
Micro-architecture design exploration template for AutoML case study on SqueezeSEMAuto
title Micro-architecture design exploration template for AutoML case study on SqueezeSEMAuto
title_full Micro-architecture design exploration template for AutoML case study on SqueezeSEMAuto
title_fullStr Micro-architecture design exploration template for AutoML case study on SqueezeSEMAuto
title_full_unstemmed Micro-architecture design exploration template for AutoML case study on SqueezeSEMAuto
title_short Micro-architecture design exploration template for AutoML case study on SqueezeSEMAuto
title_sort micro-architecture design exploration template for automl case study on squeezesemauto
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10313661/
https://www.ncbi.nlm.nih.gov/pubmed/37391458
http://dx.doi.org/10.1038/s41598-023-37682-0
work_keys_str_mv AT chantrapornchaichantana microarchitecturedesignexplorationtemplateforautomlcasestudyonsqueezesemauto
AT kajkamhaengsupasit microarchitecturedesignexplorationtemplateforautomlcasestudyonsqueezesemauto
AT romphetphattharaphon microarchitecturedesignexplorationtemplateforautomlcasestudyonsqueezesemauto