Cargando…

A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis

BACKGROUND: The success achieved by genome-wide association (GWA) studies in the identification of candidate loci for complex diseases has been accompanied by an inability to explain the bulk of heritability. Here, we describe the algorithm V-Bay, a variational Bayes algorithm for multiple locus GWA...

Descripción completa

Detalles Bibliográficos
Autores principales: Logsdon, Benjamin A, Hoffman, Gabriel E, Mezey, Jason G
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2824680/
https://www.ncbi.nlm.nih.gov/pubmed/20105321
http://dx.doi.org/10.1186/1471-2105-11-58
_version_ 1782177716565442560
author Logsdon, Benjamin A
Hoffman, Gabriel E
Mezey, Jason G
author_facet Logsdon, Benjamin A
Hoffman, Gabriel E
Mezey, Jason G
author_sort Logsdon, Benjamin A
collection PubMed
description BACKGROUND: The success achieved by genome-wide association (GWA) studies in the identification of candidate loci for complex diseases has been accompanied by an inability to explain the bulk of heritability. Here, we describe the algorithm V-Bay, a variational Bayes algorithm for multiple locus GWA analysis, which is designed to identify weaker associations that may contribute to this missing heritability. RESULTS: V-Bay provides a novel solution to the computational scaling constraints of most multiple locus methods and can complete a simultaneous analysis of a million genetic markers in a few hours, when using a desktop. Using a range of simulated genetic and GWA experimental scenarios, we demonstrate that V-Bay is highly accurate, and reliably identifies associations that are too weak to be discovered by single-marker testing approaches. V-Bay can also outperform a multiple locus analysis method based on the lasso, which has similar scaling properties for large numbers of genetic markers. For demonstration purposes, we also use V-Bay to confirm associations with gene expression in cell lines derived from the Phase II individuals of HapMap. CONCLUSIONS: V-Bay is a versatile, fast, and accurate multiple locus GWA analysis tool for the practitioner interested in identifying weaker associations without high false positive rates.
format Text
id pubmed-2824680
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28246802010-02-19 A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis Logsdon, Benjamin A Hoffman, Gabriel E Mezey, Jason G BMC Bioinformatics Methodology article BACKGROUND: The success achieved by genome-wide association (GWA) studies in the identification of candidate loci for complex diseases has been accompanied by an inability to explain the bulk of heritability. Here, we describe the algorithm V-Bay, a variational Bayes algorithm for multiple locus GWA analysis, which is designed to identify weaker associations that may contribute to this missing heritability. RESULTS: V-Bay provides a novel solution to the computational scaling constraints of most multiple locus methods and can complete a simultaneous analysis of a million genetic markers in a few hours, when using a desktop. Using a range of simulated genetic and GWA experimental scenarios, we demonstrate that V-Bay is highly accurate, and reliably identifies associations that are too weak to be discovered by single-marker testing approaches. V-Bay can also outperform a multiple locus analysis method based on the lasso, which has similar scaling properties for large numbers of genetic markers. For demonstration purposes, we also use V-Bay to confirm associations with gene expression in cell lines derived from the Phase II individuals of HapMap. CONCLUSIONS: V-Bay is a versatile, fast, and accurate multiple locus GWA analysis tool for the practitioner interested in identifying weaker associations without high false positive rates. BioMed Central 2010-01-27 /pmc/articles/PMC2824680/ /pubmed/20105321 http://dx.doi.org/10.1186/1471-2105-11-58 Text en Copyright ©2010 Logsdon et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
Logsdon, Benjamin A
Hoffman, Gabriel E
Mezey, Jason G
A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis
title A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis
title_full A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis
title_fullStr A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis
title_full_unstemmed A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis
title_short A variational Bayes algorithm for fast and accurate multiple locus genome-wide association analysis
title_sort variational bayes algorithm for fast and accurate multiple locus genome-wide association analysis
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2824680/
https://www.ncbi.nlm.nih.gov/pubmed/20105321
http://dx.doi.org/10.1186/1471-2105-11-58
work_keys_str_mv AT logsdonbenjamina avariationalbayesalgorithmforfastandaccuratemultiplelocusgenomewideassociationanalysis
AT hoffmangabriele avariationalbayesalgorithmforfastandaccuratemultiplelocusgenomewideassociationanalysis
AT mezeyjasong avariationalbayesalgorithmforfastandaccuratemultiplelocusgenomewideassociationanalysis
AT logsdonbenjamina variationalbayesalgorithmforfastandaccuratemultiplelocusgenomewideassociationanalysis
AT hoffmangabriele variationalbayesalgorithmforfastandaccuratemultiplelocusgenomewideassociationanalysis
AT mezeyjasong variationalbayesalgorithmforfastandaccuratemultiplelocusgenomewideassociationanalysis