Cargando…

An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review

BACKGROUND: Several studies have explored the predictive performance of machine learning–based breast cancer risk prediction models and have shown controversial conclusions. Thus, the performance of the current machine learning–based breast cancer risk prediction models and their benefits and weakne...

Descripción completa

Detalles Bibliográficos
Autores principales:	Gao, Ying, Li, Shu, Jin, Yujing, Zhou, Lengxiao, Sun, Shaomei, Xu, Xiaoqian, Li, Shuqian, Yang, Hongxi, Zhang, Qing, Wang, Yaogang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2022
Materias:	Review
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9837707/ https://www.ncbi.nlm.nih.gov/pubmed/36426919 http://dx.doi.org/10.2196/35750

_version_	1784869136960061440
author	Gao, Ying Li, Shu Jin, Yujing Zhou, Lengxiao Sun, Shaomei Xu, Xiaoqian Li, Shuqian Yang, Hongxi Zhang, Qing Wang, Yaogang
author_facet	Gao, Ying Li, Shu Jin, Yujing Zhou, Lengxiao Sun, Shaomei Xu, Xiaoqian Li, Shuqian Yang, Hongxi Zhang, Qing Wang, Yaogang
author_sort	Gao, Ying
collection	PubMed
description	BACKGROUND: Several studies have explored the predictive performance of machine learning–based breast cancer risk prediction models and have shown controversial conclusions. Thus, the performance of the current machine learning–based breast cancer risk prediction models and their benefits and weakness need to be evaluated for the future development of feasible and efficient risk prediction models. OBJECTIVE: The aim of this review was to assess the performance and the clinical feasibility of the currently available machine learning–based breast cancer risk prediction models. METHODS: We searched for papers published until June 9, 2021, on machine learning–based breast cancer risk prediction models in PubMed, Embase, and Web of Science. Studies describing the development or validation models for predicting future breast cancer risk were included. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was used to assess the risk of bias and the clinical applicability of the included studies. The pooled area under the curve (AUC) was calculated using the DerSimonian and Laird random-effects model. RESULTS: A total of 8 studies with 10 data sets were included. Neural network was the most common machine learning method for the development of breast cancer risk prediction models. The pooled AUC of the machine learning–based optimal risk prediction model reported in each study was 0.73 (95% CI 0.66-0.80; approximate 95% prediction interval 0.56-0.96), with a high level of heterogeneity between studies (Q=576.07, I(2)=98.44%; P<.001). The results of head-to-head comparison of the performance difference between the 2 types of models trained by the same data set showed that machine learning models had a slightly higher advantage than traditional risk factor–based models in predicting future breast cancer risk. The pooled AUC of the neural network–based risk prediction model was higher than that of the nonneural network–based optimal risk prediction model (0.71 vs 0.68, respectively). Subgroup analysis showed that the incorporation of imaging features in risk models resulted in a higher pooled AUC than the nonincorporation of imaging features in risk models (0.73 vs 0.61; P(heterogeneity)=.001, respectively). The PROBAST analysis indicated that many machine learning models had high risk of bias and poorly reported calibration analysis. CONCLUSIONS: Our review shows that the current machine learning–based breast cancer risk prediction models have some technical pitfalls and that their clinical feasibility and reliability are unsatisfactory.
format	Online Article Text
id	pubmed-9837707
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-98377072023-01-14 An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review Gao, Ying Li, Shu Jin, Yujing Zhou, Lengxiao Sun, Shaomei Xu, Xiaoqian Li, Shuqian Yang, Hongxi Zhang, Qing Wang, Yaogang JMIR Public Health Surveill Review BACKGROUND: Several studies have explored the predictive performance of machine learning–based breast cancer risk prediction models and have shown controversial conclusions. Thus, the performance of the current machine learning–based breast cancer risk prediction models and their benefits and weakness need to be evaluated for the future development of feasible and efficient risk prediction models. OBJECTIVE: The aim of this review was to assess the performance and the clinical feasibility of the currently available machine learning–based breast cancer risk prediction models. METHODS: We searched for papers published until June 9, 2021, on machine learning–based breast cancer risk prediction models in PubMed, Embase, and Web of Science. Studies describing the development or validation models for predicting future breast cancer risk were included. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was used to assess the risk of bias and the clinical applicability of the included studies. The pooled area under the curve (AUC) was calculated using the DerSimonian and Laird random-effects model. RESULTS: A total of 8 studies with 10 data sets were included. Neural network was the most common machine learning method for the development of breast cancer risk prediction models. The pooled AUC of the machine learning–based optimal risk prediction model reported in each study was 0.73 (95% CI 0.66-0.80; approximate 95% prediction interval 0.56-0.96), with a high level of heterogeneity between studies (Q=576.07, I(2)=98.44%; P<.001). The results of head-to-head comparison of the performance difference between the 2 types of models trained by the same data set showed that machine learning models had a slightly higher advantage than traditional risk factor–based models in predicting future breast cancer risk. The pooled AUC of the neural network–based risk prediction model was higher than that of the nonneural network–based optimal risk prediction model (0.71 vs 0.68, respectively). Subgroup analysis showed that the incorporation of imaging features in risk models resulted in a higher pooled AUC than the nonincorporation of imaging features in risk models (0.73 vs 0.61; P(heterogeneity)=.001, respectively). The PROBAST analysis indicated that many machine learning models had high risk of bias and poorly reported calibration analysis. CONCLUSIONS: Our review shows that the current machine learning–based breast cancer risk prediction models have some technical pitfalls and that their clinical feasibility and reliability are unsatisfactory. JMIR Publications 2022-12-29 /pmc/articles/PMC9837707/ /pubmed/36426919 http://dx.doi.org/10.2196/35750 Text en ©Ying Gao, Shu Li, Yujing Jin, Lengxiao Zhou, Shaomei Sun, Xiaoqian Xu, Shuqian Li, Hongxi Yang, Qing Zhang, Yaogang Wang. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 29.12.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on https://publichealth.jmir.org, as well as this copyright and license information must be included.
spellingShingle	Review Gao, Ying Li, Shu Jin, Yujing Zhou, Lengxiao Sun, Shaomei Xu, Xiaoqian Li, Shuqian Yang, Hongxi Zhang, Qing Wang, Yaogang An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
title	An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
title_full	An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
title_fullStr	An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
title_full_unstemmed	An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
title_short	An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
title_sort	assessment of the predictive performance of current machine learning–based breast cancer risk prediction models: systematic review
topic	Review
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9837707/ https://www.ncbi.nlm.nih.gov/pubmed/36426919 http://dx.doi.org/10.2196/35750
work_keys_str_mv	AT gaoying anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT lishu anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT jinyujing anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT zhoulengxiao anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT sunshaomei anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT xuxiaoqian anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT lishuqian anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT yanghongxi anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT zhangqing anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT wangyaogang anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT gaoying assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT lishu assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT jinyujing assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT zhoulengxiao assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT sunshaomei assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT xuxiaoqian assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT lishuqian assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT yanghongxi assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT zhangqing assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview AT wangyaogang assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview

An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review

Ejemplares similares