Cargando…

An Adaptive Deep Learning Optimization Method Based on Radius of Curvature

An adaptive clamping method (SGD-MS) based on the radius of curvature is designed to alleviate the local optimal oscillation problem in deep neural network, which combines the radius of curvature of the objective function and the gradient descent of the optimizer. The radius of curvature is consider...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Jiahui, Yang, Xinhao, Zhang, Ke, Wen, Chenrui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8598332/
https://www.ncbi.nlm.nih.gov/pubmed/34804152
http://dx.doi.org/10.1155/2021/9882068
_version_ 1784600801422868480
author Zhang, Jiahui
Yang, Xinhao
Zhang, Ke
Wen, Chenrui
author_facet Zhang, Jiahui
Yang, Xinhao
Zhang, Ke
Wen, Chenrui
author_sort Zhang, Jiahui
collection PubMed
description An adaptive clamping method (SGD-MS) based on the radius of curvature is designed to alleviate the local optimal oscillation problem in deep neural network, which combines the radius of curvature of the objective function and the gradient descent of the optimizer. The radius of curvature is considered as the threshold to separate the momentum term or the future gradient moving average term adaptively. In addition, on this basis, we propose an accelerated version (SGD-MA), which further improves the convergence speed by using the method of aggregated momentum. Experimental results on several datasets show that the proposed methods effectively alleviate the local optimal oscillation problem and greatly improve the convergence speed and accuracy. A novel parameter updating algorithm is also provided in this paper for deep neural network.
format Online
Article
Text
id pubmed-8598332
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-85983322021-11-18 An Adaptive Deep Learning Optimization Method Based on Radius of Curvature Zhang, Jiahui Yang, Xinhao Zhang, Ke Wen, Chenrui Comput Intell Neurosci Research Article An adaptive clamping method (SGD-MS) based on the radius of curvature is designed to alleviate the local optimal oscillation problem in deep neural network, which combines the radius of curvature of the objective function and the gradient descent of the optimizer. The radius of curvature is considered as the threshold to separate the momentum term or the future gradient moving average term adaptively. In addition, on this basis, we propose an accelerated version (SGD-MA), which further improves the convergence speed by using the method of aggregated momentum. Experimental results on several datasets show that the proposed methods effectively alleviate the local optimal oscillation problem and greatly improve the convergence speed and accuracy. A novel parameter updating algorithm is also provided in this paper for deep neural network. Hindawi 2021-11-10 /pmc/articles/PMC8598332/ /pubmed/34804152 http://dx.doi.org/10.1155/2021/9882068 Text en Copyright © 2021 Jiahui Zhang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Jiahui
Yang, Xinhao
Zhang, Ke
Wen, Chenrui
An Adaptive Deep Learning Optimization Method Based on Radius of Curvature
title An Adaptive Deep Learning Optimization Method Based on Radius of Curvature
title_full An Adaptive Deep Learning Optimization Method Based on Radius of Curvature
title_fullStr An Adaptive Deep Learning Optimization Method Based on Radius of Curvature
title_full_unstemmed An Adaptive Deep Learning Optimization Method Based on Radius of Curvature
title_short An Adaptive Deep Learning Optimization Method Based on Radius of Curvature
title_sort adaptive deep learning optimization method based on radius of curvature
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8598332/
https://www.ncbi.nlm.nih.gov/pubmed/34804152
http://dx.doi.org/10.1155/2021/9882068
work_keys_str_mv AT zhangjiahui anadaptivedeeplearningoptimizationmethodbasedonradiusofcurvature
AT yangxinhao anadaptivedeeplearningoptimizationmethodbasedonradiusofcurvature
AT zhangke anadaptivedeeplearningoptimizationmethodbasedonradiusofcurvature
AT wenchenrui anadaptivedeeplearningoptimizationmethodbasedonradiusofcurvature
AT zhangjiahui adaptivedeeplearningoptimizationmethodbasedonradiusofcurvature
AT yangxinhao adaptivedeeplearningoptimizationmethodbasedonradiusofcurvature
AT zhangke adaptivedeeplearningoptimizationmethodbasedonradiusofcurvature
AT wenchenrui adaptivedeeplearningoptimizationmethodbasedonradiusofcurvature