Cargando…
Understanding the importance of key risk factors in predicting chronic bronchitic symptoms using a machine learning approach
BACKGROUND: Chronic respiratory symptoms involving bronchitis, cough and phlegm in children are underappreciated but pose a significant public health burden. Efforts for prevention and management could be supported by an understanding of the relative importance of determinants, including environment...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6441159/ https://www.ncbi.nlm.nih.gov/pubmed/30925901 http://dx.doi.org/10.1186/s12874-019-0708-x |
_version_ | 1783407505051746304 |
---|---|
author | Deng, Huiyu Urman, Robert Gilliland, Frank D. Eckel, Sandrah P. |
author_facet | Deng, Huiyu Urman, Robert Gilliland, Frank D. Eckel, Sandrah P. |
author_sort | Deng, Huiyu |
collection | PubMed |
description | BACKGROUND: Chronic respiratory symptoms involving bronchitis, cough and phlegm in children are underappreciated but pose a significant public health burden. Efforts for prevention and management could be supported by an understanding of the relative importance of determinants, including environmental exposures. Thus, we aim to develop a prediction model for bronchitic symptoms. METHODS: Schoolchildren from the population-based southern California Children’s Health Study were visited annually from 2003 to 2012. Bronchitic symptoms over the prior 12 months were assessed by questionnaire. A gradient boosting model was fit using groups of risk factors (including traffic/air pollution exposures) for all children and by asthma status. Training data consisted of one observation per participant in a random study year (for 50% of participants). Validation data consisted of: (1) a random (later) year in the same participants (within-participant); (2) a random year in participants excluded from the training data (across-participant). RESULTS: At baseline, 13.2% of children had asthma and 18.1% reported bronchitic symptoms. Models performed similarly within- and across-participant. Previous year symptoms/medication use provided much of the predictive ability (across-participant area under the receiver operating characteristic curve (AUC): 0.76 vs 0.78 for all risk factors, in all participants). Traffic/air pollution exposures added modestly to prediction as did body mass index percentile, age and parent stress. CONCLUSIONS: Regardless of asthma status, previous symptoms were the most important predictors of current symptoms. Traffic/air pollution variables contribute modest predictive information, but impact large populations. Methods proposed here could be generalized to personalized exacerbation predictions in future longitudinal studies to support targeted prevention efforts. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12874-019-0708-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6441159 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-64411592019-04-11 Understanding the importance of key risk factors in predicting chronic bronchitic symptoms using a machine learning approach Deng, Huiyu Urman, Robert Gilliland, Frank D. Eckel, Sandrah P. BMC Med Res Methodol Research Article BACKGROUND: Chronic respiratory symptoms involving bronchitis, cough and phlegm in children are underappreciated but pose a significant public health burden. Efforts for prevention and management could be supported by an understanding of the relative importance of determinants, including environmental exposures. Thus, we aim to develop a prediction model for bronchitic symptoms. METHODS: Schoolchildren from the population-based southern California Children’s Health Study were visited annually from 2003 to 2012. Bronchitic symptoms over the prior 12 months were assessed by questionnaire. A gradient boosting model was fit using groups of risk factors (including traffic/air pollution exposures) for all children and by asthma status. Training data consisted of one observation per participant in a random study year (for 50% of participants). Validation data consisted of: (1) a random (later) year in the same participants (within-participant); (2) a random year in participants excluded from the training data (across-participant). RESULTS: At baseline, 13.2% of children had asthma and 18.1% reported bronchitic symptoms. Models performed similarly within- and across-participant. Previous year symptoms/medication use provided much of the predictive ability (across-participant area under the receiver operating characteristic curve (AUC): 0.76 vs 0.78 for all risk factors, in all participants). Traffic/air pollution exposures added modestly to prediction as did body mass index percentile, age and parent stress. CONCLUSIONS: Regardless of asthma status, previous symptoms were the most important predictors of current symptoms. Traffic/air pollution variables contribute modest predictive information, but impact large populations. Methods proposed here could be generalized to personalized exacerbation predictions in future longitudinal studies to support targeted prevention efforts. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12874-019-0708-x) contains supplementary material, which is available to authorized users. BioMed Central 2019-03-29 /pmc/articles/PMC6441159/ /pubmed/30925901 http://dx.doi.org/10.1186/s12874-019-0708-x Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Deng, Huiyu Urman, Robert Gilliland, Frank D. Eckel, Sandrah P. Understanding the importance of key risk factors in predicting chronic bronchitic symptoms using a machine learning approach |
title | Understanding the importance of key risk factors in predicting chronic bronchitic symptoms using a machine learning approach |
title_full | Understanding the importance of key risk factors in predicting chronic bronchitic symptoms using a machine learning approach |
title_fullStr | Understanding the importance of key risk factors in predicting chronic bronchitic symptoms using a machine learning approach |
title_full_unstemmed | Understanding the importance of key risk factors in predicting chronic bronchitic symptoms using a machine learning approach |
title_short | Understanding the importance of key risk factors in predicting chronic bronchitic symptoms using a machine learning approach |
title_sort | understanding the importance of key risk factors in predicting chronic bronchitic symptoms using a machine learning approach |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6441159/ https://www.ncbi.nlm.nih.gov/pubmed/30925901 http://dx.doi.org/10.1186/s12874-019-0708-x |
work_keys_str_mv | AT denghuiyu understandingtheimportanceofkeyriskfactorsinpredictingchronicbronchiticsymptomsusingamachinelearningapproach AT urmanrobert understandingtheimportanceofkeyriskfactorsinpredictingchronicbronchiticsymptomsusingamachinelearningapproach AT gillilandfrankd understandingtheimportanceofkeyriskfactorsinpredictingchronicbronchiticsymptomsusingamachinelearningapproach AT eckelsandrahp understandingtheimportanceofkeyriskfactorsinpredictingchronicbronchiticsymptomsusingamachinelearningapproach |