Cargando…

An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review

BACKGROUND: Several studies have explored the predictive performance of machine learning–based breast cancer risk prediction models and have shown controversial conclusions. Thus, the performance of the current machine learning–based breast cancer risk prediction models and their benefits and weakne...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Ying, Li, Shu, Jin, Yujing, Zhou, Lengxiao, Sun, Shaomei, Xu, Xiaoqian, Li, Shuqian, Yang, Hongxi, Zhang, Qing, Wang, Yaogang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9837707/
https://www.ncbi.nlm.nih.gov/pubmed/36426919
http://dx.doi.org/10.2196/35750
_version_ 1784869136960061440
author Gao, Ying
Li, Shu
Jin, Yujing
Zhou, Lengxiao
Sun, Shaomei
Xu, Xiaoqian
Li, Shuqian
Yang, Hongxi
Zhang, Qing
Wang, Yaogang
author_facet Gao, Ying
Li, Shu
Jin, Yujing
Zhou, Lengxiao
Sun, Shaomei
Xu, Xiaoqian
Li, Shuqian
Yang, Hongxi
Zhang, Qing
Wang, Yaogang
author_sort Gao, Ying
collection PubMed
description BACKGROUND: Several studies have explored the predictive performance of machine learning–based breast cancer risk prediction models and have shown controversial conclusions. Thus, the performance of the current machine learning–based breast cancer risk prediction models and their benefits and weakness need to be evaluated for the future development of feasible and efficient risk prediction models. OBJECTIVE: The aim of this review was to assess the performance and the clinical feasibility of the currently available machine learning–based breast cancer risk prediction models. METHODS: We searched for papers published until June 9, 2021, on machine learning–based breast cancer risk prediction models in PubMed, Embase, and Web of Science. Studies describing the development or validation models for predicting future breast cancer risk were included. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was used to assess the risk of bias and the clinical applicability of the included studies. The pooled area under the curve (AUC) was calculated using the DerSimonian and Laird random-effects model. RESULTS: A total of 8 studies with 10 data sets were included. Neural network was the most common machine learning method for the development of breast cancer risk prediction models. The pooled AUC of the machine learning–based optimal risk prediction model reported in each study was 0.73 (95% CI 0.66-0.80; approximate 95% prediction interval 0.56-0.96), with a high level of heterogeneity between studies (Q=576.07, I(2)=98.44%; P<.001). The results of head-to-head comparison of the performance difference between the 2 types of models trained by the same data set showed that machine learning models had a slightly higher advantage than traditional risk factor–based models in predicting future breast cancer risk. The pooled AUC of the neural network–based risk prediction model was higher than that of the nonneural network–based optimal risk prediction model (0.71 vs 0.68, respectively). Subgroup analysis showed that the incorporation of imaging features in risk models resulted in a higher pooled AUC than the nonincorporation of imaging features in risk models (0.73 vs 0.61; P(heterogeneity)=.001, respectively). The PROBAST analysis indicated that many machine learning models had high risk of bias and poorly reported calibration analysis. CONCLUSIONS: Our review shows that the current machine learning–based breast cancer risk prediction models have some technical pitfalls and that their clinical feasibility and reliability are unsatisfactory.
format Online
Article
Text
id pubmed-9837707
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-98377072023-01-14 An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review Gao, Ying Li, Shu Jin, Yujing Zhou, Lengxiao Sun, Shaomei Xu, Xiaoqian Li, Shuqian Yang, Hongxi Zhang, Qing Wang, Yaogang JMIR Public Health Surveill Review BACKGROUND: Several studies have explored the predictive performance of machine learning–based breast cancer risk prediction models and have shown controversial conclusions. Thus, the performance of the current machine learning–based breast cancer risk prediction models and their benefits and weakness need to be evaluated for the future development of feasible and efficient risk prediction models. OBJECTIVE: The aim of this review was to assess the performance and the clinical feasibility of the currently available machine learning–based breast cancer risk prediction models. METHODS: We searched for papers published until June 9, 2021, on machine learning–based breast cancer risk prediction models in PubMed, Embase, and Web of Science. Studies describing the development or validation models for predicting future breast cancer risk were included. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was used to assess the risk of bias and the clinical applicability of the included studies. The pooled area under the curve (AUC) was calculated using the DerSimonian and Laird random-effects model. RESULTS: A total of 8 studies with 10 data sets were included. Neural network was the most common machine learning method for the development of breast cancer risk prediction models. The pooled AUC of the machine learning–based optimal risk prediction model reported in each study was 0.73 (95% CI 0.66-0.80; approximate 95% prediction interval 0.56-0.96), with a high level of heterogeneity between studies (Q=576.07, I(2)=98.44%; P<.001). The results of head-to-head comparison of the performance difference between the 2 types of models trained by the same data set showed that machine learning models had a slightly higher advantage than traditional risk factor–based models in predicting future breast cancer risk. The pooled AUC of the neural network–based risk prediction model was higher than that of the nonneural network–based optimal risk prediction model (0.71 vs 0.68, respectively). Subgroup analysis showed that the incorporation of imaging features in risk models resulted in a higher pooled AUC than the nonincorporation of imaging features in risk models (0.73 vs 0.61; P(heterogeneity)=.001, respectively). The PROBAST analysis indicated that many machine learning models had high risk of bias and poorly reported calibration analysis. CONCLUSIONS: Our review shows that the current machine learning–based breast cancer risk prediction models have some technical pitfalls and that their clinical feasibility and reliability are unsatisfactory. JMIR Publications 2022-12-29 /pmc/articles/PMC9837707/ /pubmed/36426919 http://dx.doi.org/10.2196/35750 Text en ©Ying Gao, Shu Li, Yujing Jin, Lengxiao Zhou, Shaomei Sun, Xiaoqian Xu, Shuqian Li, Hongxi Yang, Qing Zhang, Yaogang Wang. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 29.12.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on https://publichealth.jmir.org, as well as this copyright and license information must be included.
spellingShingle Review
Gao, Ying
Li, Shu
Jin, Yujing
Zhou, Lengxiao
Sun, Shaomei
Xu, Xiaoqian
Li, Shuqian
Yang, Hongxi
Zhang, Qing
Wang, Yaogang
An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
title An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
title_full An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
title_fullStr An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
title_full_unstemmed An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
title_short An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review
title_sort assessment of the predictive performance of current machine learning–based breast cancer risk prediction models: systematic review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9837707/
https://www.ncbi.nlm.nih.gov/pubmed/36426919
http://dx.doi.org/10.2196/35750
work_keys_str_mv AT gaoying anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT lishu anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT jinyujing anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT zhoulengxiao anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT sunshaomei anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT xuxiaoqian anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT lishuqian anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT yanghongxi anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT zhangqing anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT wangyaogang anassessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT gaoying assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT lishu assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT jinyujing assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT zhoulengxiao assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT sunshaomei assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT xuxiaoqian assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT lishuqian assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT yanghongxi assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT zhangqing assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview
AT wangyaogang assessmentofthepredictiveperformanceofcurrentmachinelearningbasedbreastcancerriskpredictionmodelssystematicreview