Cargando…

Deep Q-learning to globally optimize a k-D parameter search for medical imaging

BACKGROUND: Estimation of the global optima of multiple model parameters is valuable for precisely extracting parameters that characterize a physical environment. This is especially useful for imaging purposes, to form reliable, meaningful physical images with good reproducibility. However, it is ch...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Hongmei, Liang, Songshi, Matkovic, Luke A., Momin, Shadab, Wang, Kai, Yang, Xiaofeng, Insana, Michael F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10423342/
https://www.ncbi.nlm.nih.gov/pubmed/37581036
http://dx.doi.org/10.21037/qims-22-1147
_version_ 1785089427320602624
author Zhang, Hongmei
Liang, Songshi
Matkovic, Luke A.
Momin, Shadab
Wang, Kai
Yang, Xiaofeng
Insana, Michael F.
author_facet Zhang, Hongmei
Liang, Songshi
Matkovic, Luke A.
Momin, Shadab
Wang, Kai
Yang, Xiaofeng
Insana, Michael F.
author_sort Zhang, Hongmei
collection PubMed
description BACKGROUND: Estimation of the global optima of multiple model parameters is valuable for precisely extracting parameters that characterize a physical environment. This is especially useful for imaging purposes, to form reliable, meaningful physical images with good reproducibility. However, it is challenging to avoid different local minima when the objective function is nonconvex. The problem of global searching of multiple parameters was formulated to be a k-D move in the parameter space and the parameter updating scheme was converted to be a state-action decision-making problem. METHODS: We proposed a novel Deep Q-learning of Model Parameters (DQMP) method for global optimization which updated the parameter configurations through actions that maximized the Q-value and employed a Deep Reward Network (DRN) designed to learn global reward values from both visible fitting errors and hidden parameter errors. The DRN was constructed with Long Short-Term Memory (LSTM) layers followed by fully connected layers and a rectified linear unit (ReLU) nonlinearity. The depth of the DRN depended on the number of parameters. Through DQMP, the k-D parameter search in each step resembled the decision-making of action selections from 3(k) configurations in a k-D board game. RESULTS: The DQMP method was evaluated by widely used general functions that can express a variety of experimental data and further validated on imaging applications. The convergence of the proposed DRN was evaluated, which showed that the loss values of six general functions all converged after 12 epochs. The parameters estimated by the DQMP method had relative errors of less than 4% for all cases, whereas the relative errors achieved by Q-learning (QL) and the Least Squares Method (LSM) were 17% and 21%, respectively. Furthermore, the imaging experiments demonstrated that the imaging of the parameters estimated by the proposed DQMP method were the closest to the ground truth simulation images when compared to other methods. CONCLUSIONS: The proposed DQMP method was able to achieve global optima, thus yielding accurate model parameter estimates. DQMP is promising for estimating multiple high-dimensional parameters and can be generalized to global optimization for many other complex nonconvex functions and imaging of physical parameters.
format Online
Article
Text
id pubmed-10423342
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher AME Publishing Company
record_format MEDLINE/PubMed
spelling pubmed-104233422023-08-14 Deep Q-learning to globally optimize a k-D parameter search for medical imaging Zhang, Hongmei Liang, Songshi Matkovic, Luke A. Momin, Shadab Wang, Kai Yang, Xiaofeng Insana, Michael F. Quant Imaging Med Surg Original Article BACKGROUND: Estimation of the global optima of multiple model parameters is valuable for precisely extracting parameters that characterize a physical environment. This is especially useful for imaging purposes, to form reliable, meaningful physical images with good reproducibility. However, it is challenging to avoid different local minima when the objective function is nonconvex. The problem of global searching of multiple parameters was formulated to be a k-D move in the parameter space and the parameter updating scheme was converted to be a state-action decision-making problem. METHODS: We proposed a novel Deep Q-learning of Model Parameters (DQMP) method for global optimization which updated the parameter configurations through actions that maximized the Q-value and employed a Deep Reward Network (DRN) designed to learn global reward values from both visible fitting errors and hidden parameter errors. The DRN was constructed with Long Short-Term Memory (LSTM) layers followed by fully connected layers and a rectified linear unit (ReLU) nonlinearity. The depth of the DRN depended on the number of parameters. Through DQMP, the k-D parameter search in each step resembled the decision-making of action selections from 3(k) configurations in a k-D board game. RESULTS: The DQMP method was evaluated by widely used general functions that can express a variety of experimental data and further validated on imaging applications. The convergence of the proposed DRN was evaluated, which showed that the loss values of six general functions all converged after 12 epochs. The parameters estimated by the DQMP method had relative errors of less than 4% for all cases, whereas the relative errors achieved by Q-learning (QL) and the Least Squares Method (LSM) were 17% and 21%, respectively. Furthermore, the imaging experiments demonstrated that the imaging of the parameters estimated by the proposed DQMP method were the closest to the ground truth simulation images when compared to other methods. CONCLUSIONS: The proposed DQMP method was able to achieve global optima, thus yielding accurate model parameter estimates. DQMP is promising for estimating multiple high-dimensional parameters and can be generalized to global optimization for many other complex nonconvex functions and imaging of physical parameters. AME Publishing Company 2023-06-27 2023-08-01 /pmc/articles/PMC10423342/ /pubmed/37581036 http://dx.doi.org/10.21037/qims-22-1147 Text en 2023 Quantitative Imaging in Medicine and Surgery. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Original Article
Zhang, Hongmei
Liang, Songshi
Matkovic, Luke A.
Momin, Shadab
Wang, Kai
Yang, Xiaofeng
Insana, Michael F.
Deep Q-learning to globally optimize a k-D parameter search for medical imaging
title Deep Q-learning to globally optimize a k-D parameter search for medical imaging
title_full Deep Q-learning to globally optimize a k-D parameter search for medical imaging
title_fullStr Deep Q-learning to globally optimize a k-D parameter search for medical imaging
title_full_unstemmed Deep Q-learning to globally optimize a k-D parameter search for medical imaging
title_short Deep Q-learning to globally optimize a k-D parameter search for medical imaging
title_sort deep q-learning to globally optimize a k-d parameter search for medical imaging
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10423342/
https://www.ncbi.nlm.nih.gov/pubmed/37581036
http://dx.doi.org/10.21037/qims-22-1147
work_keys_str_mv AT zhanghongmei deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging
AT liangsongshi deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging
AT matkoviclukea deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging
AT mominshadab deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging
AT wangkai deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging
AT yangxiaofeng deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging
AT insanamichaelf deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging