Cargando…
Deep Q-learning to globally optimize a k-D parameter search for medical imaging
BACKGROUND: Estimation of the global optima of multiple model parameters is valuable for precisely extracting parameters that characterize a physical environment. This is especially useful for imaging purposes, to form reliable, meaningful physical images with good reproducibility. However, it is ch...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AME Publishing Company
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10423342/ https://www.ncbi.nlm.nih.gov/pubmed/37581036 http://dx.doi.org/10.21037/qims-22-1147 |
_version_ | 1785089427320602624 |
---|---|
author | Zhang, Hongmei Liang, Songshi Matkovic, Luke A. Momin, Shadab Wang, Kai Yang, Xiaofeng Insana, Michael F. |
author_facet | Zhang, Hongmei Liang, Songshi Matkovic, Luke A. Momin, Shadab Wang, Kai Yang, Xiaofeng Insana, Michael F. |
author_sort | Zhang, Hongmei |
collection | PubMed |
description | BACKGROUND: Estimation of the global optima of multiple model parameters is valuable for precisely extracting parameters that characterize a physical environment. This is especially useful for imaging purposes, to form reliable, meaningful physical images with good reproducibility. However, it is challenging to avoid different local minima when the objective function is nonconvex. The problem of global searching of multiple parameters was formulated to be a k-D move in the parameter space and the parameter updating scheme was converted to be a state-action decision-making problem. METHODS: We proposed a novel Deep Q-learning of Model Parameters (DQMP) method for global optimization which updated the parameter configurations through actions that maximized the Q-value and employed a Deep Reward Network (DRN) designed to learn global reward values from both visible fitting errors and hidden parameter errors. The DRN was constructed with Long Short-Term Memory (LSTM) layers followed by fully connected layers and a rectified linear unit (ReLU) nonlinearity. The depth of the DRN depended on the number of parameters. Through DQMP, the k-D parameter search in each step resembled the decision-making of action selections from 3(k) configurations in a k-D board game. RESULTS: The DQMP method was evaluated by widely used general functions that can express a variety of experimental data and further validated on imaging applications. The convergence of the proposed DRN was evaluated, which showed that the loss values of six general functions all converged after 12 epochs. The parameters estimated by the DQMP method had relative errors of less than 4% for all cases, whereas the relative errors achieved by Q-learning (QL) and the Least Squares Method (LSM) were 17% and 21%, respectively. Furthermore, the imaging experiments demonstrated that the imaging of the parameters estimated by the proposed DQMP method were the closest to the ground truth simulation images when compared to other methods. CONCLUSIONS: The proposed DQMP method was able to achieve global optima, thus yielding accurate model parameter estimates. DQMP is promising for estimating multiple high-dimensional parameters and can be generalized to global optimization for many other complex nonconvex functions and imaging of physical parameters. |
format | Online Article Text |
id | pubmed-10423342 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | AME Publishing Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-104233422023-08-14 Deep Q-learning to globally optimize a k-D parameter search for medical imaging Zhang, Hongmei Liang, Songshi Matkovic, Luke A. Momin, Shadab Wang, Kai Yang, Xiaofeng Insana, Michael F. Quant Imaging Med Surg Original Article BACKGROUND: Estimation of the global optima of multiple model parameters is valuable for precisely extracting parameters that characterize a physical environment. This is especially useful for imaging purposes, to form reliable, meaningful physical images with good reproducibility. However, it is challenging to avoid different local minima when the objective function is nonconvex. The problem of global searching of multiple parameters was formulated to be a k-D move in the parameter space and the parameter updating scheme was converted to be a state-action decision-making problem. METHODS: We proposed a novel Deep Q-learning of Model Parameters (DQMP) method for global optimization which updated the parameter configurations through actions that maximized the Q-value and employed a Deep Reward Network (DRN) designed to learn global reward values from both visible fitting errors and hidden parameter errors. The DRN was constructed with Long Short-Term Memory (LSTM) layers followed by fully connected layers and a rectified linear unit (ReLU) nonlinearity. The depth of the DRN depended on the number of parameters. Through DQMP, the k-D parameter search in each step resembled the decision-making of action selections from 3(k) configurations in a k-D board game. RESULTS: The DQMP method was evaluated by widely used general functions that can express a variety of experimental data and further validated on imaging applications. The convergence of the proposed DRN was evaluated, which showed that the loss values of six general functions all converged after 12 epochs. The parameters estimated by the DQMP method had relative errors of less than 4% for all cases, whereas the relative errors achieved by Q-learning (QL) and the Least Squares Method (LSM) were 17% and 21%, respectively. Furthermore, the imaging experiments demonstrated that the imaging of the parameters estimated by the proposed DQMP method were the closest to the ground truth simulation images when compared to other methods. CONCLUSIONS: The proposed DQMP method was able to achieve global optima, thus yielding accurate model parameter estimates. DQMP is promising for estimating multiple high-dimensional parameters and can be generalized to global optimization for many other complex nonconvex functions and imaging of physical parameters. AME Publishing Company 2023-06-27 2023-08-01 /pmc/articles/PMC10423342/ /pubmed/37581036 http://dx.doi.org/10.21037/qims-22-1147 Text en 2023 Quantitative Imaging in Medicine and Surgery. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Original Article Zhang, Hongmei Liang, Songshi Matkovic, Luke A. Momin, Shadab Wang, Kai Yang, Xiaofeng Insana, Michael F. Deep Q-learning to globally optimize a k-D parameter search for medical imaging |
title | Deep Q-learning to globally optimize a k-D parameter search for medical imaging |
title_full | Deep Q-learning to globally optimize a k-D parameter search for medical imaging |
title_fullStr | Deep Q-learning to globally optimize a k-D parameter search for medical imaging |
title_full_unstemmed | Deep Q-learning to globally optimize a k-D parameter search for medical imaging |
title_short | Deep Q-learning to globally optimize a k-D parameter search for medical imaging |
title_sort | deep q-learning to globally optimize a k-d parameter search for medical imaging |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10423342/ https://www.ncbi.nlm.nih.gov/pubmed/37581036 http://dx.doi.org/10.21037/qims-22-1147 |
work_keys_str_mv | AT zhanghongmei deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging AT liangsongshi deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging AT matkoviclukea deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging AT mominshadab deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging AT wangkai deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging AT yangxiaofeng deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging AT insanamichaelf deepqlearningtogloballyoptimizeakdparametersearchformedicalimaging |