Cargando…
Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA
Machine learning (ML) has demonstrated promise in predicting mortality; however, understanding spatial variation in risk factor contributions to mortality rate requires explainability. We applied explainable artificial intelligence (XAI) on a stack-ensemble machine learning model framework to explor...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8677843/ https://www.ncbi.nlm.nih.gov/pubmed/34916529 http://dx.doi.org/10.1038/s41598-021-03198-8 |
_version_ | 1784616228140089344 |
---|---|
author | Ahmed, Zia U. Sun, Kang Shelly, Michael Mu, Lina |
author_facet | Ahmed, Zia U. Sun, Kang Shelly, Michael Mu, Lina |
author_sort | Ahmed, Zia U. |
collection | PubMed |
description | Machine learning (ML) has demonstrated promise in predicting mortality; however, understanding spatial variation in risk factor contributions to mortality rate requires explainability. We applied explainable artificial intelligence (XAI) on a stack-ensemble machine learning model framework to explore and visualize the spatial distribution of the contributions of known risk factors to lung and bronchus cancer (LBC) mortality rates in the conterminous United States. We used five base-learners—generalized linear model (GLM), random forest (RF), Gradient boosting machine (GBM), extreme Gradient boosting machine (XGBoost), and Deep Neural Network (DNN) for developing stack-ensemble models. Then we applied several model-agnostic approaches to interpret and visualize the stack ensemble model's output in global and local scales (at the county level). The stack ensemble generally performs better than all the base learners and three spatial regression models. A permutation-based feature importance technique ranked smoking prevalence as the most important predictor, followed by poverty and elevation. However, the impact of these risk factors on LBC mortality rates varies spatially. This is the first study to use ensemble machine learning with explainable algorithms to explore and visualize the spatial heterogeneity of the relationships between LBC mortality and risk factors in the contiguous USA. |
format | Online Article Text |
id | pubmed-8677843 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-86778432021-12-20 Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA Ahmed, Zia U. Sun, Kang Shelly, Michael Mu, Lina Sci Rep Article Machine learning (ML) has demonstrated promise in predicting mortality; however, understanding spatial variation in risk factor contributions to mortality rate requires explainability. We applied explainable artificial intelligence (XAI) on a stack-ensemble machine learning model framework to explore and visualize the spatial distribution of the contributions of known risk factors to lung and bronchus cancer (LBC) mortality rates in the conterminous United States. We used five base-learners—generalized linear model (GLM), random forest (RF), Gradient boosting machine (GBM), extreme Gradient boosting machine (XGBoost), and Deep Neural Network (DNN) for developing stack-ensemble models. Then we applied several model-agnostic approaches to interpret and visualize the stack ensemble model's output in global and local scales (at the county level). The stack ensemble generally performs better than all the base learners and three spatial regression models. A permutation-based feature importance technique ranked smoking prevalence as the most important predictor, followed by poverty and elevation. However, the impact of these risk factors on LBC mortality rates varies spatially. This is the first study to use ensemble machine learning with explainable algorithms to explore and visualize the spatial heterogeneity of the relationships between LBC mortality and risk factors in the contiguous USA. Nature Publishing Group UK 2021-12-16 /pmc/articles/PMC8677843/ /pubmed/34916529 http://dx.doi.org/10.1038/s41598-021-03198-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Ahmed, Zia U. Sun, Kang Shelly, Michael Mu, Lina Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA |
title | Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA |
title_full | Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA |
title_fullStr | Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA |
title_full_unstemmed | Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA |
title_short | Explainable artificial intelligence (XAI) for exploring spatial variability of lung and bronchus cancer (LBC) mortality rates in the contiguous USA |
title_sort | explainable artificial intelligence (xai) for exploring spatial variability of lung and bronchus cancer (lbc) mortality rates in the contiguous usa |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8677843/ https://www.ncbi.nlm.nih.gov/pubmed/34916529 http://dx.doi.org/10.1038/s41598-021-03198-8 |
work_keys_str_mv | AT ahmedziau explainableartificialintelligencexaiforexploringspatialvariabilityoflungandbronchuscancerlbcmortalityratesinthecontiguoususa AT sunkang explainableartificialintelligencexaiforexploringspatialvariabilityoflungandbronchuscancerlbcmortalityratesinthecontiguoususa AT shellymichael explainableartificialintelligencexaiforexploringspatialvariabilityoflungandbronchuscancerlbcmortalityratesinthecontiguoususa AT mulina explainableartificialintelligencexaiforexploringspatialvariabilityoflungandbronchuscancerlbcmortalityratesinthecontiguoususa |