Cargando…
Selecting the best optimizers for deep learning–based medical image segmentation
PURPOSE: The goal of this work is to explore the best optimizers for deep learning in the context of medical image segmentation and to provide guidance on how to design segmentation networks with effective optimization strategies. APPROACH: Most successful deep learning networks are trained using tw...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10551178/ https://www.ncbi.nlm.nih.gov/pubmed/37810757 http://dx.doi.org/10.3389/fradi.2023.1175473 |
_version_ | 1785115705613484032 |
---|---|
author | Mortazi, Aliasghar Cicek, Vedat Keles, Elif Bagci, Ulas |
author_facet | Mortazi, Aliasghar Cicek, Vedat Keles, Elif Bagci, Ulas |
author_sort | Mortazi, Aliasghar |
collection | PubMed |
description | PURPOSE: The goal of this work is to explore the best optimizers for deep learning in the context of medical image segmentation and to provide guidance on how to design segmentation networks with effective optimization strategies. APPROACH: Most successful deep learning networks are trained using two types of stochastic gradient descent (SGD) algorithms: adaptive learning and accelerated schemes. Adaptive learning helps with fast convergence by starting with a larger learning rate (LR) and gradually decreasing it. Momentum optimizers are particularly effective at quickly optimizing neural networks within the accelerated schemes category. By revealing the potential interplay between these two types of algorithms [LR and momentum optimizers or momentum rate (MR) in short], in this article, we explore the two variants of SGD algorithms in a single setting. We suggest using cyclic learning as the base optimizer and integrating optimal values of learning rate and momentum rate. The new optimization function proposed in this work is based on the Nesterov accelerated gradient optimizer, which is more efficient computationally and has better generalization capabilities compared to other adaptive optimizers. RESULTS: We investigated the relationship of LR and MR under an important problem of medical image segmentation of cardiac structures from MRI and CT scans. We conducted experiments using the cardiac imaging dataset from the ACDC challenge of MICCAI 2017, and four different architectures were shown to be successful for cardiac image segmentation problems. Our comprehensive evaluations demonstrated that the proposed optimizer achieved better results (over a 2% improvement in the dice metric) than other optimizers in the deep learning literature with similar or lower computational cost in both single and multi-object segmentation settings. CONCLUSIONS: We hypothesized that the combination of accelerated and adaptive optimization methods can have a drastic effect in medical image segmentation performances. To this end, we proposed a new cyclic optimization method (Cyclic Learning/Momentum Rate) to address the efficiency and accuracy problems in deep learning–based medical image segmentation. The proposed strategy yielded better generalization in comparison to adaptive optimizers. |
format | Online Article Text |
id | pubmed-10551178 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-105511782023-10-06 Selecting the best optimizers for deep learning–based medical image segmentation Mortazi, Aliasghar Cicek, Vedat Keles, Elif Bagci, Ulas Front Radiol Radiology PURPOSE: The goal of this work is to explore the best optimizers for deep learning in the context of medical image segmentation and to provide guidance on how to design segmentation networks with effective optimization strategies. APPROACH: Most successful deep learning networks are trained using two types of stochastic gradient descent (SGD) algorithms: adaptive learning and accelerated schemes. Adaptive learning helps with fast convergence by starting with a larger learning rate (LR) and gradually decreasing it. Momentum optimizers are particularly effective at quickly optimizing neural networks within the accelerated schemes category. By revealing the potential interplay between these two types of algorithms [LR and momentum optimizers or momentum rate (MR) in short], in this article, we explore the two variants of SGD algorithms in a single setting. We suggest using cyclic learning as the base optimizer and integrating optimal values of learning rate and momentum rate. The new optimization function proposed in this work is based on the Nesterov accelerated gradient optimizer, which is more efficient computationally and has better generalization capabilities compared to other adaptive optimizers. RESULTS: We investigated the relationship of LR and MR under an important problem of medical image segmentation of cardiac structures from MRI and CT scans. We conducted experiments using the cardiac imaging dataset from the ACDC challenge of MICCAI 2017, and four different architectures were shown to be successful for cardiac image segmentation problems. Our comprehensive evaluations demonstrated that the proposed optimizer achieved better results (over a 2% improvement in the dice metric) than other optimizers in the deep learning literature with similar or lower computational cost in both single and multi-object segmentation settings. CONCLUSIONS: We hypothesized that the combination of accelerated and adaptive optimization methods can have a drastic effect in medical image segmentation performances. To this end, we proposed a new cyclic optimization method (Cyclic Learning/Momentum Rate) to address the efficiency and accuracy problems in deep learning–based medical image segmentation. The proposed strategy yielded better generalization in comparison to adaptive optimizers. Frontiers Media S.A. 2023-09-21 /pmc/articles/PMC10551178/ /pubmed/37810757 http://dx.doi.org/10.3389/fradi.2023.1175473 Text en © 2023 Mortazi, Cicek, Keles and Bagci. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) (https://creativecommons.org/licenses/by/4.0/) . The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Radiology Mortazi, Aliasghar Cicek, Vedat Keles, Elif Bagci, Ulas Selecting the best optimizers for deep learning–based medical image segmentation |
title | Selecting the best optimizers for deep learning–based medical image segmentation |
title_full | Selecting the best optimizers for deep learning–based medical image segmentation |
title_fullStr | Selecting the best optimizers for deep learning–based medical image segmentation |
title_full_unstemmed | Selecting the best optimizers for deep learning–based medical image segmentation |
title_short | Selecting the best optimizers for deep learning–based medical image segmentation |
title_sort | selecting the best optimizers for deep learning–based medical image segmentation |
topic | Radiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10551178/ https://www.ncbi.nlm.nih.gov/pubmed/37810757 http://dx.doi.org/10.3389/fradi.2023.1175473 |
work_keys_str_mv | AT mortazialiasghar selectingthebestoptimizersfordeeplearningbasedmedicalimagesegmentation AT cicekvedat selectingthebestoptimizersfordeeplearningbasedmedicalimagesegmentation AT keleselif selectingthebestoptimizersfordeeplearningbasedmedicalimagesegmentation AT bagciulas selectingthebestoptimizersfordeeplearningbasedmedicalimagesegmentation |