Cargando…

Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival

Cox Proportional Hazards (CPH) analysis is the standard for survival analysis in oncology. Recently, several machine learning (ML) techniques have been adapted for this task. Although they have shown to yield results at least as good as classical methods, they are often disregarded because of their...

Descripción completa

Detalles Bibliográficos
Autores principales:	Moncada-Torres, Arturo, van Maaren, Marissa C., Hendriks, Mathijs P., Siesling, Sabine, Geleijnse, Gijs
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7998037/ https://www.ncbi.nlm.nih.gov/pubmed/33772109 http://dx.doi.org/10.1038/s41598-021-86327-7

_version_	1783670460288860160
author	Moncada-Torres, Arturo van Maaren, Marissa C. Hendriks, Mathijs P. Siesling, Sabine Geleijnse, Gijs
author_facet	Moncada-Torres, Arturo van Maaren, Marissa C. Hendriks, Mathijs P. Siesling, Sabine Geleijnse, Gijs
author_sort	Moncada-Torres, Arturo
collection	PubMed
description	Cox Proportional Hazards (CPH) analysis is the standard for survival analysis in oncology. Recently, several machine learning (ML) techniques have been adapted for this task. Although they have shown to yield results at least as good as classical methods, they are often disregarded because of their lack of transparency and little to no explainability, which are key for their adoption in clinical settings. In this paper, we used data from the Netherlands Cancer Registry of 36,658 non-metastatic breast cancer patients to compare the performance of CPH with ML techniques (Random Survival Forests, Survival Support Vector Machines, and Extreme Gradient Boosting [XGB]) in predicting survival using the [Formula: see text] -index. We demonstrated that in our dataset, ML-based models can perform at least as good as the classical CPH regression ([Formula: see text] -index [Formula: see text] ), and in the case of XGB even better ([Formula: see text] -index [Formula: see text] ). Furthermore, we used Shapley Additive Explanation (SHAP) values to explain the models’ predictions. We concluded that the difference in performance can be attributed to XGB’s ability to model nonlinearities and complex interactions. We also investigated the impact of specific features on the models’ predictions as well as their corresponding insights. Lastly, we showed that explainable ML can generate explicit knowledge of how models make their predictions, which is crucial in increasing the trust and adoption of innovative ML techniques in oncology and healthcare overall.
format	Online Article Text
id	pubmed-7998037
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-79980372021-03-30 Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival Moncada-Torres, Arturo van Maaren, Marissa C. Hendriks, Mathijs P. Siesling, Sabine Geleijnse, Gijs Sci Rep Article Cox Proportional Hazards (CPH) analysis is the standard for survival analysis in oncology. Recently, several machine learning (ML) techniques have been adapted for this task. Although they have shown to yield results at least as good as classical methods, they are often disregarded because of their lack of transparency and little to no explainability, which are key for their adoption in clinical settings. In this paper, we used data from the Netherlands Cancer Registry of 36,658 non-metastatic breast cancer patients to compare the performance of CPH with ML techniques (Random Survival Forests, Survival Support Vector Machines, and Extreme Gradient Boosting [XGB]) in predicting survival using the [Formula: see text] -index. We demonstrated that in our dataset, ML-based models can perform at least as good as the classical CPH regression ([Formula: see text] -index [Formula: see text] ), and in the case of XGB even better ([Formula: see text] -index [Formula: see text] ). Furthermore, we used Shapley Additive Explanation (SHAP) values to explain the models’ predictions. We concluded that the difference in performance can be attributed to XGB’s ability to model nonlinearities and complex interactions. We also investigated the impact of specific features on the models’ predictions as well as their corresponding insights. Lastly, we showed that explainable ML can generate explicit knowledge of how models make their predictions, which is crucial in increasing the trust and adoption of innovative ML techniques in oncology and healthcare overall. Nature Publishing Group UK 2021-03-26 /pmc/articles/PMC7998037/ /pubmed/33772109 http://dx.doi.org/10.1038/s41598-021-86327-7 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Article Moncada-Torres, Arturo van Maaren, Marissa C. Hendriks, Mathijs P. Siesling, Sabine Geleijnse, Gijs Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival
title	Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival
title_full	Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival
title_fullStr	Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival
title_full_unstemmed	Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival
title_short	Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival
title_sort	explainable machine learning can outperform cox regression predictions and provide insights in breast cancer survival
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7998037/ https://www.ncbi.nlm.nih.gov/pubmed/33772109 http://dx.doi.org/10.1038/s41598-021-86327-7
work_keys_str_mv	AT moncadatorresarturo explainablemachinelearningcanoutperformcoxregressionpredictionsandprovideinsightsinbreastcancersurvival AT vanmaarenmarissac explainablemachinelearningcanoutperformcoxregressionpredictionsandprovideinsightsinbreastcancersurvival AT hendriksmathijsp explainablemachinelearningcanoutperformcoxregressionpredictionsandprovideinsightsinbreastcancersurvival AT sieslingsabine explainablemachinelearningcanoutperformcoxregressionpredictionsandprovideinsightsinbreastcancersurvival AT geleijnsegijs explainablemachinelearningcanoutperformcoxregressionpredictionsandprovideinsightsinbreastcancersurvival

Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival

Ejemplares similares