Cargando…

Predicting Lung Cancer in the United States: A Multiple Model Examination of Public Health Factors

In this research, we take a multivariate, multi-method approach to predicting the incidence of lung cancer in the United States. We obtain public health and ambient emission data from multiple sources in 2000–2013 to model lung cancer in the period 2013–2017. We compare several models using four sou...

Descripción completa

Detalles Bibliográficos
Autores principales: Kamis, Arnold, Cao, Rui, He, Yifan, Tian, Yuan, Wu, Chuyue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8201047/
https://www.ncbi.nlm.nih.gov/pubmed/34204140
http://dx.doi.org/10.3390/ijerph18116127
_version_ 1783707725733036032
author Kamis, Arnold
Cao, Rui
He, Yifan
Tian, Yuan
Wu, Chuyue
author_facet Kamis, Arnold
Cao, Rui
He, Yifan
Tian, Yuan
Wu, Chuyue
author_sort Kamis, Arnold
collection PubMed
description In this research, we take a multivariate, multi-method approach to predicting the incidence of lung cancer in the United States. We obtain public health and ambient emission data from multiple sources in 2000–2013 to model lung cancer in the period 2013–2017. We compare several models using four sources of predictor variables: adult smoking, state, environmental quality index, and ambient emissions. The environmental quality index variables pertain to macro-level domains: air, land, water, socio-demographic, and built environment. The ambient emissions consist of Cyanide compounds, Carbon Monoxide, Carbon Disulfide, Diesel Exhaust, Nitrogen Dioxide, Tropospheric Ozone, Coarse Particulate Matter, Fine Particulate Matter, and Sulfur Dioxide. We compare various models and find that the best regression model has variance explained of 62 percent whereas the best machine learning model has 64 percent variance explained with 10% less error. The most hazardous ambient emissions are Coarse Particulate Matter, Fine Particulate Matter, Sulfur Dioxide, Carbon Monoxide, and Tropospheric Ozone. These ambient emissions could be curtailed to improve air quality, thus reducing the incidence of lung cancer. We interpret and discuss the implications of the model results, including the tradeoff between transparency and accuracy. We also review limitations of and directions for the current models in order to extend and refine them.
format Online
Article
Text
id pubmed-8201047
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-82010472021-06-15 Predicting Lung Cancer in the United States: A Multiple Model Examination of Public Health Factors Kamis, Arnold Cao, Rui He, Yifan Tian, Yuan Wu, Chuyue Int J Environ Res Public Health Article In this research, we take a multivariate, multi-method approach to predicting the incidence of lung cancer in the United States. We obtain public health and ambient emission data from multiple sources in 2000–2013 to model lung cancer in the period 2013–2017. We compare several models using four sources of predictor variables: adult smoking, state, environmental quality index, and ambient emissions. The environmental quality index variables pertain to macro-level domains: air, land, water, socio-demographic, and built environment. The ambient emissions consist of Cyanide compounds, Carbon Monoxide, Carbon Disulfide, Diesel Exhaust, Nitrogen Dioxide, Tropospheric Ozone, Coarse Particulate Matter, Fine Particulate Matter, and Sulfur Dioxide. We compare various models and find that the best regression model has variance explained of 62 percent whereas the best machine learning model has 64 percent variance explained with 10% less error. The most hazardous ambient emissions are Coarse Particulate Matter, Fine Particulate Matter, Sulfur Dioxide, Carbon Monoxide, and Tropospheric Ozone. These ambient emissions could be curtailed to improve air quality, thus reducing the incidence of lung cancer. We interpret and discuss the implications of the model results, including the tradeoff between transparency and accuracy. We also review limitations of and directions for the current models in order to extend and refine them. MDPI 2021-06-06 /pmc/articles/PMC8201047/ /pubmed/34204140 http://dx.doi.org/10.3390/ijerph18116127 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kamis, Arnold
Cao, Rui
He, Yifan
Tian, Yuan
Wu, Chuyue
Predicting Lung Cancer in the United States: A Multiple Model Examination of Public Health Factors
title Predicting Lung Cancer in the United States: A Multiple Model Examination of Public Health Factors
title_full Predicting Lung Cancer in the United States: A Multiple Model Examination of Public Health Factors
title_fullStr Predicting Lung Cancer in the United States: A Multiple Model Examination of Public Health Factors
title_full_unstemmed Predicting Lung Cancer in the United States: A Multiple Model Examination of Public Health Factors
title_short Predicting Lung Cancer in the United States: A Multiple Model Examination of Public Health Factors
title_sort predicting lung cancer in the united states: a multiple model examination of public health factors
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8201047/
https://www.ncbi.nlm.nih.gov/pubmed/34204140
http://dx.doi.org/10.3390/ijerph18116127
work_keys_str_mv AT kamisarnold predictinglungcancerintheunitedstatesamultiplemodelexaminationofpublichealthfactors
AT caorui predictinglungcancerintheunitedstatesamultiplemodelexaminationofpublichealthfactors
AT heyifan predictinglungcancerintheunitedstatesamultiplemodelexaminationofpublichealthfactors
AT tianyuan predictinglungcancerintheunitedstatesamultiplemodelexaminationofpublichealthfactors
AT wuchuyue predictinglungcancerintheunitedstatesamultiplemodelexaminationofpublichealthfactors