Cargando…

Nonlinear Hyperparameter Optimization of a Neural Network in Image Processing for Micromachines

Deep neural networks are widely used in the field of image processing for micromachines, such as in 3D shape detection in microelectronic high-speed dispensing and object detection in microrobots. It is already known that hyperparameters and their interactions impact neural network model performance...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Mingming, Yang, Jing, Li, Shaobo, Zhang, Ansi, Bai, Qiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8704841/
https://www.ncbi.nlm.nih.gov/pubmed/34945353
http://dx.doi.org/10.3390/mi12121504
_version_ 1784621802624909312
author Shen, Mingming
Yang, Jing
Li, Shaobo
Zhang, Ansi
Bai, Qiang
author_facet Shen, Mingming
Yang, Jing
Li, Shaobo
Zhang, Ansi
Bai, Qiang
author_sort Shen, Mingming
collection PubMed
description Deep neural networks are widely used in the field of image processing for micromachines, such as in 3D shape detection in microelectronic high-speed dispensing and object detection in microrobots. It is already known that hyperparameters and their interactions impact neural network model performance. Taking advantage of the mathematical correlations between hyperparameters and the corresponding deep learning model to adjust hyperparameters intelligently is the key to obtaining an optimal solution from a deep neural network model. Leveraging these correlations is also significant for unlocking the “black box” of deep learning by revealing the mechanism of its mathematical principle. However, there is no complete system for studying the combination of mathematical derivation and experimental verification methods to quantify the impacts of hyperparameters on the performances of deep learning models. Therefore, in this paper, the authors analyzed the mathematical relationships among four hyperparameters: the learning rate, batch size, dropout rate, and convolution kernel size. A generalized multiparameter mathematical correlation model was also established, which showed that the interaction between these hyperparameters played an important role in the neural network’s performance. Different experiments were verified by running convolutional neural network algorithms to validate the proposal on the MNIST dataset. Notably, this research can help establish a universal multiparameter mathematical correlation model to guide the deep learning parameter adjustment process.
format Online
Article
Text
id pubmed-8704841
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87048412021-12-25 Nonlinear Hyperparameter Optimization of a Neural Network in Image Processing for Micromachines Shen, Mingming Yang, Jing Li, Shaobo Zhang, Ansi Bai, Qiang Micromachines (Basel) Article Deep neural networks are widely used in the field of image processing for micromachines, such as in 3D shape detection in microelectronic high-speed dispensing and object detection in microrobots. It is already known that hyperparameters and their interactions impact neural network model performance. Taking advantage of the mathematical correlations between hyperparameters and the corresponding deep learning model to adjust hyperparameters intelligently is the key to obtaining an optimal solution from a deep neural network model. Leveraging these correlations is also significant for unlocking the “black box” of deep learning by revealing the mechanism of its mathematical principle. However, there is no complete system for studying the combination of mathematical derivation and experimental verification methods to quantify the impacts of hyperparameters on the performances of deep learning models. Therefore, in this paper, the authors analyzed the mathematical relationships among four hyperparameters: the learning rate, batch size, dropout rate, and convolution kernel size. A generalized multiparameter mathematical correlation model was also established, which showed that the interaction between these hyperparameters played an important role in the neural network’s performance. Different experiments were verified by running convolutional neural network algorithms to validate the proposal on the MNIST dataset. Notably, this research can help establish a universal multiparameter mathematical correlation model to guide the deep learning parameter adjustment process. MDPI 2021-11-30 /pmc/articles/PMC8704841/ /pubmed/34945353 http://dx.doi.org/10.3390/mi12121504 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Shen, Mingming
Yang, Jing
Li, Shaobo
Zhang, Ansi
Bai, Qiang
Nonlinear Hyperparameter Optimization of a Neural Network in Image Processing for Micromachines
title Nonlinear Hyperparameter Optimization of a Neural Network in Image Processing for Micromachines
title_full Nonlinear Hyperparameter Optimization of a Neural Network in Image Processing for Micromachines
title_fullStr Nonlinear Hyperparameter Optimization of a Neural Network in Image Processing for Micromachines
title_full_unstemmed Nonlinear Hyperparameter Optimization of a Neural Network in Image Processing for Micromachines
title_short Nonlinear Hyperparameter Optimization of a Neural Network in Image Processing for Micromachines
title_sort nonlinear hyperparameter optimization of a neural network in image processing for micromachines
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8704841/
https://www.ncbi.nlm.nih.gov/pubmed/34945353
http://dx.doi.org/10.3390/mi12121504
work_keys_str_mv AT shenmingming nonlinearhyperparameteroptimizationofaneuralnetworkinimageprocessingformicromachines
AT yangjing nonlinearhyperparameteroptimizationofaneuralnetworkinimageprocessingformicromachines
AT lishaobo nonlinearhyperparameteroptimizationofaneuralnetworkinimageprocessingformicromachines
AT zhangansi nonlinearhyperparameteroptimizationofaneuralnetworkinimageprocessingformicromachines
AT baiqiang nonlinearhyperparameteroptimizationofaneuralnetworkinimageprocessingformicromachines