Cargando…

Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA

Machine learning (ML) has demonstrated promise in predicting mortality; however, understanding spatial variation in risk factor contributions to mortality rate requires explainability. We applied explainable artificial intelligence (XAI) on a stack-ensemble machine learning model framework to explor...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahmed, Zia U., Sun, Kang, Shelly, Michael, Mu, Lina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8677843/
https://www.ncbi.nlm.nih.gov/pubmed/34916529
http://dx.doi.org/10.1038/s41598-021-03198-8
_version_ 1784616228140089344
author Ahmed, Zia U.
Sun, Kang
Shelly, Michael
Mu, Lina
author_facet Ahmed, Zia U.
Sun, Kang
Shelly, Michael
Mu, Lina
author_sort Ahmed, Zia U.
collection PubMed
description Machine learning (ML) has demonstrated promise in predicting mortality; however, understanding spatial variation in risk factor contributions to mortality rate requires explainability. We applied explainable artificial intelligence (XAI) on a stack-ensemble machine learning model framework to explore and visualize the spatial distribution of the contributions of known risk factors to lung and bronchus cancer (LBC) mortality rates in the conterminous United States. We used five base-learners—generalized linear model (GLM), random forest (RF), Gradient boosting machine (GBM), extreme Gradient boosting machine (XGBoost), and Deep Neural Network (DNN) for developing stack-ensemble models. Then we applied several model-agnostic approaches to interpret and visualize the stack ensemble model's output in global and local scales (at the county level). The stack ensemble generally performs better than all the base learners and three spatial regression models. A permutation-based feature importance technique ranked smoking prevalence as the most important predictor, followed by poverty and elevation. However, the impact of these risk factors on LBC mortality rates varies spatially. This is the first study to use ensemble machine learning with explainable algorithms to explore and visualize the spatial heterogeneity of the relationships between LBC mortality and risk factors in the contiguous USA.
format Online
Article
Text
id pubmed-8677843
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-86778432021-12-20 Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA Ahmed, Zia U. Sun, Kang Shelly, Michael Mu, Lina Sci Rep Article Machine learning (ML) has demonstrated promise in predicting mortality; however, understanding spatial variation in risk factor contributions to mortality rate requires explainability. We applied explainable artificial intelligence (XAI) on a stack-ensemble machine learning model framework to explore and visualize the spatial distribution of the contributions of known risk factors to lung and bronchus cancer (LBC) mortality rates in the conterminous United States. We used five base-learners—generalized linear model (GLM), random forest (RF), Gradient boosting machine (GBM), extreme Gradient boosting machine (XGBoost), and Deep Neural Network (DNN) for developing stack-ensemble models. Then we applied several model-agnostic approaches to interpret and visualize the stack ensemble model's output in global and local scales (at the county level). The stack ensemble generally performs better than all the base learners and three spatial regression models. A permutation-based feature importance technique ranked smoking prevalence as the most important predictor, followed by poverty and elevation. However, the impact of these risk factors on LBC mortality rates varies spatially. This is the first study to use ensemble machine learning with explainable algorithms to explore and visualize the spatial heterogeneity of the relationships between LBC mortality and risk factors in the contiguous USA. Nature Publishing Group UK 2021-12-16 /pmc/articles/PMC8677843/ /pubmed/34916529 http://dx.doi.org/10.1038/s41598-021-03198-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Ahmed, Zia U.
Sun, Kang
Shelly, Michael
Mu, Lina
Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA
title Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA
title_full Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA
title_fullStr Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA
title_full_unstemmed Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA
title_short Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA
title_sort explainable artificial intelligence (xai) for exploring spatial variability of lung and bronchus cancer (lbc) mortality rates in the contiguous usa
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8677843/
https://www.ncbi.nlm.nih.gov/pubmed/34916529
http://dx.doi.org/10.1038/s41598-021-03198-8
work_keys_str_mv AT ahmedziau explainableartificialintelligencexaiforexploringspatialvariabilityoflungandbronchuscancerlbcmortalityratesinthecontiguoususa
AT sunkang explainableartificialintelligencexaiforexploringspatialvariabilityoflungandbronchuscancerlbcmortalityratesinthecontiguoususa
AT shellymichael explainableartificialintelligencexaiforexploringspatialvariabilityoflungandbronchuscancerlbcmortalityratesinthecontiguoususa
AT mulina explainableartificialintelligencexaiforexploringspatialvariabilityoflungandbronchuscancerlbcmortalityratesinthecontiguoususa