Cargando…

A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies

Gene expression measurements are influenced by a wide range of factors, such as the state of the cell, experimental conditions and variants in the sequence of regulatory regions. To understand the effect of a variable of interest, such as the genotype of a locus, it is important to account for varia...

Descripción completa

Detalles Bibliográficos
Autores principales: Stegle, Oliver, Parts, Leopold, Durbin, Richard, Winn, John
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2865505/
https://www.ncbi.nlm.nih.gov/pubmed/20463871
http://dx.doi.org/10.1371/journal.pcbi.1000770
_version_ 1782180845981794304
author Stegle, Oliver
Parts, Leopold
Durbin, Richard
Winn, John
author_facet Stegle, Oliver
Parts, Leopold
Durbin, Richard
Winn, John
author_sort Stegle, Oliver
collection PubMed
description Gene expression measurements are influenced by a wide range of factors, such as the state of the cell, experimental conditions and variants in the sequence of regulatory regions. To understand the effect of a variable of interest, such as the genotype of a locus, it is important to account for variation that is due to confounding causes. Here, we present VBQTL, a probabilistic approach for mapping expression quantitative trait loci (eQTLs) that jointly models contributions from genotype as well as known and hidden confounding factors. VBQTL is implemented within an efficient and flexible inference framework, making it fast and tractable on large-scale problems. We compare the performance of VBQTL with alternative methods for dealing with confounding variability on eQTL mapping datasets from simulations, yeast, mouse, and human. Employing Bayesian complexity control and joint modelling is shown to result in more precise estimates of the contribution of different confounding factors resulting in additional associations to measured transcript levels compared to alternative approaches. We present a threefold larger collection of cis eQTLs than previously found in a whole-genome eQTL scan of an outbred human population. Altogether, 27% of the tested probes show a significant genetic association in cis, and we validate that the additional eQTLs are likely to be real by replicating them in different sets of individuals. Our method is the next step in the analysis of high-dimensional phenotype data, and its application has revealed insights into genetic regulation of gene expression by demonstrating more abundant cis-acting eQTLs in human than previously shown. Our software is freely available online at http://www.sanger.ac.uk/resources/software/peer/.
format Text
id pubmed-2865505
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28655052010-05-12 A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies Stegle, Oliver Parts, Leopold Durbin, Richard Winn, John PLoS Comput Biol Research Article Gene expression measurements are influenced by a wide range of factors, such as the state of the cell, experimental conditions and variants in the sequence of regulatory regions. To understand the effect of a variable of interest, such as the genotype of a locus, it is important to account for variation that is due to confounding causes. Here, we present VBQTL, a probabilistic approach for mapping expression quantitative trait loci (eQTLs) that jointly models contributions from genotype as well as known and hidden confounding factors. VBQTL is implemented within an efficient and flexible inference framework, making it fast and tractable on large-scale problems. We compare the performance of VBQTL with alternative methods for dealing with confounding variability on eQTL mapping datasets from simulations, yeast, mouse, and human. Employing Bayesian complexity control and joint modelling is shown to result in more precise estimates of the contribution of different confounding factors resulting in additional associations to measured transcript levels compared to alternative approaches. We present a threefold larger collection of cis eQTLs than previously found in a whole-genome eQTL scan of an outbred human population. Altogether, 27% of the tested probes show a significant genetic association in cis, and we validate that the additional eQTLs are likely to be real by replicating them in different sets of individuals. Our method is the next step in the analysis of high-dimensional phenotype data, and its application has revealed insights into genetic regulation of gene expression by demonstrating more abundant cis-acting eQTLs in human than previously shown. Our software is freely available online at http://www.sanger.ac.uk/resources/software/peer/. Public Library of Science 2010-05-06 /pmc/articles/PMC2865505/ /pubmed/20463871 http://dx.doi.org/10.1371/journal.pcbi.1000770 Text en Stegle et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Stegle, Oliver
Parts, Leopold
Durbin, Richard
Winn, John
A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies
title A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies
title_full A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies
title_fullStr A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies
title_full_unstemmed A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies
title_short A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies
title_sort bayesian framework to account for complex non-genetic factors in gene expression levels greatly increases power in eqtl studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2865505/
https://www.ncbi.nlm.nih.gov/pubmed/20463871
http://dx.doi.org/10.1371/journal.pcbi.1000770
work_keys_str_mv AT stegleoliver abayesianframeworktoaccountforcomplexnongeneticfactorsingeneexpressionlevelsgreatlyincreasespowerineqtlstudies
AT partsleopold abayesianframeworktoaccountforcomplexnongeneticfactorsingeneexpressionlevelsgreatlyincreasespowerineqtlstudies
AT durbinrichard abayesianframeworktoaccountforcomplexnongeneticfactorsingeneexpressionlevelsgreatlyincreasespowerineqtlstudies
AT winnjohn abayesianframeworktoaccountforcomplexnongeneticfactorsingeneexpressionlevelsgreatlyincreasespowerineqtlstudies
AT stegleoliver bayesianframeworktoaccountforcomplexnongeneticfactorsingeneexpressionlevelsgreatlyincreasespowerineqtlstudies
AT partsleopold bayesianframeworktoaccountforcomplexnongeneticfactorsingeneexpressionlevelsgreatlyincreasespowerineqtlstudies
AT durbinrichard bayesianframeworktoaccountforcomplexnongeneticfactorsingeneexpressionlevelsgreatlyincreasespowerineqtlstudies
AT winnjohn bayesianframeworktoaccountforcomplexnongeneticfactorsingeneexpressionlevelsgreatlyincreasespowerineqtlstudies