Cargando…

Genome Wide Association Study to predict severe asthma exacerbations in children using random forests classifiers

BACKGROUND: Personalized health-care promises tailored health-care solutions to individual patients based on their genetic background and/or environmental exposure history. To date, disease prediction has been based on a few environmental factors and/or single nucleotide polymorphisms (SNPs), while...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Mousheng, Tantisira, Kelan G, Wu, Ann, Litonjua, Augusto A, Chu, Jen-hwa, Himes, Blanca E, Damask, Amy, Weiss, Scott T
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3148549/
https://www.ncbi.nlm.nih.gov/pubmed/21718536
http://dx.doi.org/10.1186/1471-2350-12-90
_version_ 1782209358780694528
author Xu, Mousheng
Tantisira, Kelan G
Wu, Ann
Litonjua, Augusto A
Chu, Jen-hwa
Himes, Blanca E
Damask, Amy
Weiss, Scott T
author_facet Xu, Mousheng
Tantisira, Kelan G
Wu, Ann
Litonjua, Augusto A
Chu, Jen-hwa
Himes, Blanca E
Damask, Amy
Weiss, Scott T
author_sort Xu, Mousheng
collection PubMed
description BACKGROUND: Personalized health-care promises tailored health-care solutions to individual patients based on their genetic background and/or environmental exposure history. To date, disease prediction has been based on a few environmental factors and/or single nucleotide polymorphisms (SNPs), while complex diseases are usually affected by many genetic and environmental factors with each factor contributing a small portion to the outcome. We hypothesized that the use of random forests classifiers to select SNPs would result in an improved predictive model of asthma exacerbations. We tested this hypothesis in a population of childhood asthmatics. METHODS: In this study, using emergency room visits or hospitalizations as the definition of a severe asthma exacerbation, we first identified a list of top Genome Wide Association Study (GWAS) SNPs ranked by Random Forests (RF) importance score for the CAMP (Childhood Asthma Management Program) population of 127 exacerbation cases and 290 non-exacerbation controls. We predict severe asthma exacerbations using the top 10 to 320 SNPs together with age, sex, pre-bronchodilator FEV1 percentage predicted, and treatment group. RESULTS: Testing in an independent set of the CAMP population shows that severe asthma exacerbations can be predicted with an Area Under the Curve (AUC) = 0.66 with 160-320 SNPs in comparison to an AUC score of 0.57 with 10 SNPs. Using the clinical traits alone yielded AUC score of 0.54, suggesting the phenotype is affected by genetic as well as environmental factors. CONCLUSIONS: Our study shows that a random forests algorithm can effectively extract and use the information contained in a small number of samples. Random forests, and other machine learning tools, can be used with GWAS studies to integrate large numbers of predictors simultaneously.
format Online
Article
Text
id pubmed-3148549
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31485492011-08-03 Genome Wide Association Study to predict severe asthma exacerbations in children using random forests classifiers Xu, Mousheng Tantisira, Kelan G Wu, Ann Litonjua, Augusto A Chu, Jen-hwa Himes, Blanca E Damask, Amy Weiss, Scott T BMC Med Genet Research Article BACKGROUND: Personalized health-care promises tailored health-care solutions to individual patients based on their genetic background and/or environmental exposure history. To date, disease prediction has been based on a few environmental factors and/or single nucleotide polymorphisms (SNPs), while complex diseases are usually affected by many genetic and environmental factors with each factor contributing a small portion to the outcome. We hypothesized that the use of random forests classifiers to select SNPs would result in an improved predictive model of asthma exacerbations. We tested this hypothesis in a population of childhood asthmatics. METHODS: In this study, using emergency room visits or hospitalizations as the definition of a severe asthma exacerbation, we first identified a list of top Genome Wide Association Study (GWAS) SNPs ranked by Random Forests (RF) importance score for the CAMP (Childhood Asthma Management Program) population of 127 exacerbation cases and 290 non-exacerbation controls. We predict severe asthma exacerbations using the top 10 to 320 SNPs together with age, sex, pre-bronchodilator FEV1 percentage predicted, and treatment group. RESULTS: Testing in an independent set of the CAMP population shows that severe asthma exacerbations can be predicted with an Area Under the Curve (AUC) = 0.66 with 160-320 SNPs in comparison to an AUC score of 0.57 with 10 SNPs. Using the clinical traits alone yielded AUC score of 0.54, suggesting the phenotype is affected by genetic as well as environmental factors. CONCLUSIONS: Our study shows that a random forests algorithm can effectively extract and use the information contained in a small number of samples. Random forests, and other machine learning tools, can be used with GWAS studies to integrate large numbers of predictors simultaneously. BioMed Central 2011-06-30 /pmc/articles/PMC3148549/ /pubmed/21718536 http://dx.doi.org/10.1186/1471-2350-12-90 Text en Copyright ©2011 Xu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Xu, Mousheng
Tantisira, Kelan G
Wu, Ann
Litonjua, Augusto A
Chu, Jen-hwa
Himes, Blanca E
Damask, Amy
Weiss, Scott T
Genome Wide Association Study to predict severe asthma exacerbations in children using random forests classifiers
title Genome Wide Association Study to predict severe asthma exacerbations in children using random forests classifiers
title_full Genome Wide Association Study to predict severe asthma exacerbations in children using random forests classifiers
title_fullStr Genome Wide Association Study to predict severe asthma exacerbations in children using random forests classifiers
title_full_unstemmed Genome Wide Association Study to predict severe asthma exacerbations in children using random forests classifiers
title_short Genome Wide Association Study to predict severe asthma exacerbations in children using random forests classifiers
title_sort genome wide association study to predict severe asthma exacerbations in children using random forests classifiers
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3148549/
https://www.ncbi.nlm.nih.gov/pubmed/21718536
http://dx.doi.org/10.1186/1471-2350-12-90
work_keys_str_mv AT xumousheng genomewideassociationstudytopredictsevereasthmaexacerbationsinchildrenusingrandomforestsclassifiers
AT tantisirakelang genomewideassociationstudytopredictsevereasthmaexacerbationsinchildrenusingrandomforestsclassifiers
AT wuann genomewideassociationstudytopredictsevereasthmaexacerbationsinchildrenusingrandomforestsclassifiers
AT litonjuaaugustoa genomewideassociationstudytopredictsevereasthmaexacerbationsinchildrenusingrandomforestsclassifiers
AT chujenhwa genomewideassociationstudytopredictsevereasthmaexacerbationsinchildrenusingrandomforestsclassifiers
AT himesblancae genomewideassociationstudytopredictsevereasthmaexacerbationsinchildrenusingrandomforestsclassifiers
AT damaskamy genomewideassociationstudytopredictsevereasthmaexacerbationsinchildrenusingrandomforestsclassifiers
AT weissscottt genomewideassociationstudytopredictsevereasthmaexacerbationsinchildrenusingrandomforestsclassifiers