Cargando…
An Adaptive Deep Learning Optimization Method Based on Radius of Curvature
An adaptive clamping method (SGD-MS) based on the radius of curvature is designed to alleviate the local optimal oscillation problem in deep neural network, which combines the radius of curvature of the objective function and the gradient descent of the optimizer. The radius of curvature is consider...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8598332/ https://www.ncbi.nlm.nih.gov/pubmed/34804152 http://dx.doi.org/10.1155/2021/9882068 |
_version_ | 1784600801422868480 |
---|---|
author | Zhang, Jiahui Yang, Xinhao Zhang, Ke Wen, Chenrui |
author_facet | Zhang, Jiahui Yang, Xinhao Zhang, Ke Wen, Chenrui |
author_sort | Zhang, Jiahui |
collection | PubMed |
description | An adaptive clamping method (SGD-MS) based on the radius of curvature is designed to alleviate the local optimal oscillation problem in deep neural network, which combines the radius of curvature of the objective function and the gradient descent of the optimizer. The radius of curvature is considered as the threshold to separate the momentum term or the future gradient moving average term adaptively. In addition, on this basis, we propose an accelerated version (SGD-MA), which further improves the convergence speed by using the method of aggregated momentum. Experimental results on several datasets show that the proposed methods effectively alleviate the local optimal oscillation problem and greatly improve the convergence speed and accuracy. A novel parameter updating algorithm is also provided in this paper for deep neural network. |
format | Online Article Text |
id | pubmed-8598332 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-85983322021-11-18 An Adaptive Deep Learning Optimization Method Based on Radius of Curvature Zhang, Jiahui Yang, Xinhao Zhang, Ke Wen, Chenrui Comput Intell Neurosci Research Article An adaptive clamping method (SGD-MS) based on the radius of curvature is designed to alleviate the local optimal oscillation problem in deep neural network, which combines the radius of curvature of the objective function and the gradient descent of the optimizer. The radius of curvature is considered as the threshold to separate the momentum term or the future gradient moving average term adaptively. In addition, on this basis, we propose an accelerated version (SGD-MA), which further improves the convergence speed by using the method of aggregated momentum. Experimental results on several datasets show that the proposed methods effectively alleviate the local optimal oscillation problem and greatly improve the convergence speed and accuracy. A novel parameter updating algorithm is also provided in this paper for deep neural network. Hindawi 2021-11-10 /pmc/articles/PMC8598332/ /pubmed/34804152 http://dx.doi.org/10.1155/2021/9882068 Text en Copyright © 2021 Jiahui Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhang, Jiahui Yang, Xinhao Zhang, Ke Wen, Chenrui An Adaptive Deep Learning Optimization Method Based on Radius of Curvature |
title | An Adaptive Deep Learning Optimization Method Based on Radius of Curvature |
title_full | An Adaptive Deep Learning Optimization Method Based on Radius of Curvature |
title_fullStr | An Adaptive Deep Learning Optimization Method Based on Radius of Curvature |
title_full_unstemmed | An Adaptive Deep Learning Optimization Method Based on Radius of Curvature |
title_short | An Adaptive Deep Learning Optimization Method Based on Radius of Curvature |
title_sort | adaptive deep learning optimization method based on radius of curvature |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8598332/ https://www.ncbi.nlm.nih.gov/pubmed/34804152 http://dx.doi.org/10.1155/2021/9882068 |
work_keys_str_mv | AT zhangjiahui anadaptivedeeplearningoptimizationmethodbasedonradiusofcurvature AT yangxinhao anadaptivedeeplearningoptimizationmethodbasedonradiusofcurvature AT zhangke anadaptivedeeplearningoptimizationmethodbasedonradiusofcurvature AT wenchenrui anadaptivedeeplearningoptimizationmethodbasedonradiusofcurvature AT zhangjiahui adaptivedeeplearningoptimizationmethodbasedonradiusofcurvature AT yangxinhao adaptivedeeplearningoptimizationmethodbasedonradiusofcurvature AT zhangke adaptivedeeplearningoptimizationmethodbasedonradiusofcurvature AT wenchenrui adaptivedeeplearningoptimizationmethodbasedonradiusofcurvature |