Cargando…

Powerful Tests for Multi-Marker Association Analysis Using Ensemble Learning

Multi-marker approaches have received a lot of attention recently in genome wide association studies and can enhance power to detect new associations under certain conditions. Gene-, gene-set- and pathway-based association tests are increasingly being viewed as useful supplements to the more widely...

Descripción completa

Detalles Bibliográficos
Autores principales: Padhukasahasram, Badri, Reddy, Chandan K., Levin, Albert M., Burchard, Esteban G., Williams, L. Keoki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4664402/
https://www.ncbi.nlm.nih.gov/pubmed/26619286
http://dx.doi.org/10.1371/journal.pone.0143489
_version_ 1782403417694535680
author Padhukasahasram, Badri
Reddy, Chandan K.
Levin, Albert M.
Burchard, Esteban G.
Williams, L. Keoki
author_facet Padhukasahasram, Badri
Reddy, Chandan K.
Levin, Albert M.
Burchard, Esteban G.
Williams, L. Keoki
author_sort Padhukasahasram, Badri
collection PubMed
description Multi-marker approaches have received a lot of attention recently in genome wide association studies and can enhance power to detect new associations under certain conditions. Gene-, gene-set- and pathway-based association tests are increasingly being viewed as useful supplements to the more widely used single marker association analysis which have successfully uncovered numerous disease variants. A major drawback of single-marker based methods is that they do not look at the joint effects of multiple genetic variants which individually may have weak or moderate signals. Here, we describe novel tests for multi-marker association analyses that are based on phenotype predictions obtained from machine learning algorithms. Instead of assuming a linear or logistic regression model, we propose the use of ensembles of diverse machine learning algorithms for prediction. We show that phenotype predictions obtained from ensemble learning algorithms provide a new framework for multi-marker association analysis. They can be used for constructing tests for the joint association of multiple variants, adjusting for covariates and testing for the presence of interactions. To demonstrate the power and utility of this new approach, we first apply our method to simulated SNP datasets. We show that the proposed method has the correct Type-1 error rates and can be considerably more powerful than alternative approaches in some situations. Then, we apply our method to previously studied asthma-related genes in 2 independent asthma cohorts to conduct association tests.
format Online
Article
Text
id pubmed-4664402
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46644022015-12-10 Powerful Tests for Multi-Marker Association Analysis Using Ensemble Learning Padhukasahasram, Badri Reddy, Chandan K. Levin, Albert M. Burchard, Esteban G. Williams, L. Keoki PLoS One Research Article Multi-marker approaches have received a lot of attention recently in genome wide association studies and can enhance power to detect new associations under certain conditions. Gene-, gene-set- and pathway-based association tests are increasingly being viewed as useful supplements to the more widely used single marker association analysis which have successfully uncovered numerous disease variants. A major drawback of single-marker based methods is that they do not look at the joint effects of multiple genetic variants which individually may have weak or moderate signals. Here, we describe novel tests for multi-marker association analyses that are based on phenotype predictions obtained from machine learning algorithms. Instead of assuming a linear or logistic regression model, we propose the use of ensembles of diverse machine learning algorithms for prediction. We show that phenotype predictions obtained from ensemble learning algorithms provide a new framework for multi-marker association analysis. They can be used for constructing tests for the joint association of multiple variants, adjusting for covariates and testing for the presence of interactions. To demonstrate the power and utility of this new approach, we first apply our method to simulated SNP datasets. We show that the proposed method has the correct Type-1 error rates and can be considerably more powerful than alternative approaches in some situations. Then, we apply our method to previously studied asthma-related genes in 2 independent asthma cohorts to conduct association tests. Public Library of Science 2015-11-30 /pmc/articles/PMC4664402/ /pubmed/26619286 http://dx.doi.org/10.1371/journal.pone.0143489 Text en © 2015 Padhukasahasram et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Padhukasahasram, Badri
Reddy, Chandan K.
Levin, Albert M.
Burchard, Esteban G.
Williams, L. Keoki
Powerful Tests for Multi-Marker Association Analysis Using Ensemble Learning
title Powerful Tests for Multi-Marker Association Analysis Using Ensemble Learning
title_full Powerful Tests for Multi-Marker Association Analysis Using Ensemble Learning
title_fullStr Powerful Tests for Multi-Marker Association Analysis Using Ensemble Learning
title_full_unstemmed Powerful Tests for Multi-Marker Association Analysis Using Ensemble Learning
title_short Powerful Tests for Multi-Marker Association Analysis Using Ensemble Learning
title_sort powerful tests for multi-marker association analysis using ensemble learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4664402/
https://www.ncbi.nlm.nih.gov/pubmed/26619286
http://dx.doi.org/10.1371/journal.pone.0143489
work_keys_str_mv AT padhukasahasrambadri powerfultestsformultimarkerassociationanalysisusingensemblelearning
AT reddychandank powerfultestsformultimarkerassociationanalysisusingensemblelearning
AT levinalbertm powerfultestsformultimarkerassociationanalysisusingensemblelearning
AT burchardestebang powerfultestsformultimarkerassociationanalysisusingensemblelearning
AT williamslkeoki powerfultestsformultimarkerassociationanalysisusingensemblelearning