Cargando…

Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models

BACKGROUND: Comprehensive breast cancer risk prediction models enable identifying and targeting women at high-risk, while reducing interventions in those at low-risk. Breast cancer risk prediction models used in clinical practice have low discriminatory accuracy (0.53–0.64). Machine learning (ML) of...

Descripción completa

Detalles Bibliográficos
Autores principales: Ming, Chang, Viassolo, Valeria, Probst-Hensch, Nicole, Chappuis, Pierre O., Dinov, Ivo D., Katapodi, Maria C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6585114/
https://www.ncbi.nlm.nih.gov/pubmed/31221197
http://dx.doi.org/10.1186/s13058-019-1158-4
_version_ 1783428643172646912
author Ming, Chang
Viassolo, Valeria
Probst-Hensch, Nicole
Chappuis, Pierre O.
Dinov, Ivo D.
Katapodi, Maria C.
author_facet Ming, Chang
Viassolo, Valeria
Probst-Hensch, Nicole
Chappuis, Pierre O.
Dinov, Ivo D.
Katapodi, Maria C.
author_sort Ming, Chang
collection PubMed
description BACKGROUND: Comprehensive breast cancer risk prediction models enable identifying and targeting women at high-risk, while reducing interventions in those at low-risk. Breast cancer risk prediction models used in clinical practice have low discriminatory accuracy (0.53–0.64). Machine learning (ML) offers an alternative approach to standard prediction modeling that may address current limitations and improve accuracy of those tools. The purpose of this study was to compare the discriminatory accuracy of ML-based estimates against a pair of established methods—the Breast Cancer Risk Assessment Tool (BCRAT) and Breast and Ovarian Analysis of Disease Incidence and Carrier Estimation Algorithm (BOADICEA) models. METHODS: We quantified and compared the performance of eight different ML methods to the performance of BCRAT and BOADICEA using eight simulated datasets and two retrospective samples: a random population-based sample of U.S. breast cancer patients and their cancer-free female relatives (N = 1143), and a clinical sample of Swiss breast cancer patients and cancer-free women seeking genetic evaluation and/or testing (N = 2481). RESULTS: Predictive accuracy (AU-ROC curve) reached 88.28% using ML-Adaptive Boosting and 88.89% using ML-random forest versus 62.40% with BCRAT for the U.S. population-based sample. Predictive accuracy reached 90.17% using ML-adaptive boosting and 89.32% using ML-Markov chain Monte Carlo generalized linear mixed model versus 59.31% with BOADICEA for the Swiss clinic-based sample. CONCLUSIONS: There was a striking improvement in the accuracy of classification of women with and without breast cancer achieved with ML algorithms compared to the state-of-the-art model-based approaches. High-accuracy prediction techniques are important in personalized medicine because they facilitate stratification of prevention strategies and individualized clinical management.
format Online
Article
Text
id pubmed-6585114
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65851142019-06-27 Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models Ming, Chang Viassolo, Valeria Probst-Hensch, Nicole Chappuis, Pierre O. Dinov, Ivo D. Katapodi, Maria C. Breast Cancer Res Research Article BACKGROUND: Comprehensive breast cancer risk prediction models enable identifying and targeting women at high-risk, while reducing interventions in those at low-risk. Breast cancer risk prediction models used in clinical practice have low discriminatory accuracy (0.53–0.64). Machine learning (ML) offers an alternative approach to standard prediction modeling that may address current limitations and improve accuracy of those tools. The purpose of this study was to compare the discriminatory accuracy of ML-based estimates against a pair of established methods—the Breast Cancer Risk Assessment Tool (BCRAT) and Breast and Ovarian Analysis of Disease Incidence and Carrier Estimation Algorithm (BOADICEA) models. METHODS: We quantified and compared the performance of eight different ML methods to the performance of BCRAT and BOADICEA using eight simulated datasets and two retrospective samples: a random population-based sample of U.S. breast cancer patients and their cancer-free female relatives (N = 1143), and a clinical sample of Swiss breast cancer patients and cancer-free women seeking genetic evaluation and/or testing (N = 2481). RESULTS: Predictive accuracy (AU-ROC curve) reached 88.28% using ML-Adaptive Boosting and 88.89% using ML-random forest versus 62.40% with BCRAT for the U.S. population-based sample. Predictive accuracy reached 90.17% using ML-adaptive boosting and 89.32% using ML-Markov chain Monte Carlo generalized linear mixed model versus 59.31% with BOADICEA for the Swiss clinic-based sample. CONCLUSIONS: There was a striking improvement in the accuracy of classification of women with and without breast cancer achieved with ML algorithms compared to the state-of-the-art model-based approaches. High-accuracy prediction techniques are important in personalized medicine because they facilitate stratification of prevention strategies and individualized clinical management. BioMed Central 2019-06-20 2019 /pmc/articles/PMC6585114/ /pubmed/31221197 http://dx.doi.org/10.1186/s13058-019-1158-4 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ming, Chang
Viassolo, Valeria
Probst-Hensch, Nicole
Chappuis, Pierre O.
Dinov, Ivo D.
Katapodi, Maria C.
Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models
title Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models
title_full Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models
title_fullStr Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models
title_full_unstemmed Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models
title_short Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models
title_sort machine learning techniques for personalized breast cancer risk prediction: comparison with the bcrat and boadicea models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6585114/
https://www.ncbi.nlm.nih.gov/pubmed/31221197
http://dx.doi.org/10.1186/s13058-019-1158-4
work_keys_str_mv AT mingchang machinelearningtechniquesforpersonalizedbreastcancerriskpredictioncomparisonwiththebcratandboadiceamodels
AT viassolovaleria machinelearningtechniquesforpersonalizedbreastcancerriskpredictioncomparisonwiththebcratandboadiceamodels
AT probsthenschnicole machinelearningtechniquesforpersonalizedbreastcancerriskpredictioncomparisonwiththebcratandboadiceamodels
AT chappuispierreo machinelearningtechniquesforpersonalizedbreastcancerriskpredictioncomparisonwiththebcratandboadiceamodels
AT dinovivod machinelearningtechniquesforpersonalizedbreastcancerriskpredictioncomparisonwiththebcratandboadiceamodels
AT katapodimariac machinelearningtechniquesforpersonalizedbreastcancerriskpredictioncomparisonwiththebcratandboadiceamodels