Cargando…
An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
BACKGROUND: Several studies have explored the predictive performance of machine learning–based breast cancer risk prediction models and have shown controversial conclusions. Thus, the performance of the current machine learning–based breast cancer risk prediction models and their benefits and weakne...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9837707/ https://www.ncbi.nlm.nih.gov/pubmed/36426919 http://dx.doi.org/10.2196/35750 |
_version_ | 1784869136960061440 |
---|---|
author | Gao, Ying Li, Shu Jin, Yujing Zhou, Lengxiao Sun, Shaomei Xu, Xiaoqian Li, Shuqian Yang, Hongxi Zhang, Qing Wang, Yaogang |
author_facet | Gao, Ying Li, Shu Jin, Yujing Zhou, Lengxiao Sun, Shaomei Xu, Xiaoqian Li, Shuqian Yang, Hongxi Zhang, Qing Wang, Yaogang |
author_sort | Gao, Ying |
collection | PubMed |
description | BACKGROUND: Several studies have explored the predictive performance of machine learning–based breast cancer risk prediction models and have shown controversial conclusions. Thus, the performance of the current machine learning–based breast cancer risk prediction models and their benefits and weakness need to be evaluated for the future development of feasible and efficient risk prediction models. OBJECTIVE: The aim of this review was to assess the performance and the clinical feasibility of the currently available machine learning–based breast cancer risk prediction models. METHODS: We searched for papers published until June 9, 2021, on machine learning–based breast cancer risk prediction models in PubMed, Embase, and Web of Science. Studies describing the development or validation models for predicting future breast cancer risk were included. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was used to assess the risk of bias and the clinical applicability of the included studies. The pooled area under the curve (AUC) was calculated using the DerSimonian and Laird random-effects model. RESULTS: A total of 8 studies with 10 data sets were included. Neural network was the most common machine learning method for the development of breast cancer risk prediction models. The pooled AUC of the machine learning–based optimal risk prediction model reported in each study was 0.73 (95% CI 0.66-0.80; approximate 95% prediction interval 0.56-0.96), with a high level of heterogeneity between studies (Q=576.07, I(2)=98.44%; P<.001). The results of head-to-head comparison of the performance difference between the 2 types of models trained by the same data set showed that machine learning models had a slightly higher advantage than traditional risk factor–based models in predicting future breast cancer risk. The pooled AUC of the neural network–based risk prediction model was higher than that of the nonneural network–based optimal risk prediction model (0.71 vs 0.68, respectively). Subgroup analysis showed that the incorporation of imaging features in risk models resulted in a higher pooled AUC than the nonincorporation of imaging features in risk models (0.73 vs 0.61; P(heterogeneity)=.001, respectively). The PROBAST analysis indicated that many machine learning models had high risk of bias and poorly reported calibration analysis. CONCLUSIONS: Our review shows that the current machine learning–based breast cancer risk prediction models have some technical pitfalls and that their clinical feasibility and reliability are unsatisfactory. |
format | Online Article Text |
id | pubmed-9837707 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-98377072023-01-14 An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review Gao, Ying Li, Shu Jin, Yujing Zhou, Lengxiao Sun, Shaomei Xu, Xiaoqian Li, Shuqian Yang, Hongxi Zhang, Qing Wang, Yaogang JMIR Public Health Surveill Review BACKGROUND: Several studies have explored the predictive performance of machine learning–based breast cancer risk prediction models and have shown controversial conclusions. Thus, the performance of the current machine learning–based breast cancer risk prediction models and their benefits and weakness need to be evaluated for the future development of feasible and efficient risk prediction models. OBJECTIVE: The aim of this review was to assess the performance and the clinical feasibility of the currently available machine learning–based breast cancer risk prediction models. METHODS: We searched for papers published until June 9, 2021, on machine learning–based breast cancer risk prediction models in PubMed, Embase, and Web of Science. Studies describing the development or validation models for predicting future breast cancer risk were included. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was used to assess the risk of bias and the clinical applicability of the included studies. The pooled area under the curve (AUC) was calculated using the DerSimonian and Laird random-effects model. RESULTS: A total of 8 studies with 10 data sets were included. Neural network was the most common machine learning method for the development of breast cancer risk prediction models. The pooled AUC of the machine learning–based optimal risk prediction model reported in each study was 0.73 (95% CI 0.66-0.80; approximate 95% prediction interval 0.56-0.96), with a high level of heterogeneity between studies (Q=576.07, I(2)=98.44%; P<.001). The results of head-to-head comparison of the performance difference between the 2 types of models trained by the same data set showed that machine learning models had a slightly higher advantage than traditional risk factor–based models in predicting future breast cancer risk. The pooled AUC of the neural network–based risk prediction model was higher than that of the nonneural network–based optimal risk prediction model (0.71 vs 0.68, respectively). Subgroup analysis showed that the incorporation of imaging features in risk models resulted in a higher pooled AUC than the nonincorporation of imaging features in risk models (0.73 vs 0.61; P(heterogeneity)=.001, respectively). The PROBAST analysis indicated that many machine learning models had high risk of bias and poorly reported calibration analysis. CONCLUSIONS: Our review shows that the current machine learning–based breast cancer risk prediction models have some technical pitfalls and that their clinical feasibility and reliability are unsatisfactory. JMIR Publications 2022-12-29 /pmc/articles/PMC9837707/ /pubmed/36426919 http://dx.doi.org/10.2196/35750 Text en ©Ying Gao, Shu Li, Yujing Jin, Lengxiao Zhou, Shaomei Sun, Xiaoqian Xu, Shuqian Li, Hongxi Yang, Qing Zhang, Yaogang Wang. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 29.12.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on https://publichealth.jmir.org, as well as this copyright and license information must be included. |
spellingShingle | Review Gao, Ying Li, Shu Jin, Yujing Zhou, Lengxiao Sun, Shaomei Xu, Xiaoqian Li, Shuqian Yang, Hongxi Zhang, Qing Wang, Yaogang An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review |
title | An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review |
title_full | An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review |
title_fullStr | An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review |
title_full_unstemmed | An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review |
title_short | An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review |
title_sort | assessment of the predictive performance of current machine learning–based breast cancer risk prediction models: systematic review |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9837707/ https://www.ncbi.nlm.nih.gov/pubmed/36426919 http://dx.doi.org/10.2196/35750 |
work_keys_str_mv | AT gaoying anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT lishu anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT jinyujing anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT zhoulengxiao anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT sunshaomei anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT xuxiaoqian anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT lishuqian anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT yanghongxi anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT zhangqing anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT wangyaogang anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT gaoying assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT lishu assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT jinyujing assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT zhoulengxiao assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT sunshaomei assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT xuxiaoqian assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT lishuqian assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT yanghongxi assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT zhangqing assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT wangyaogang assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview |