Cargando…

An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci

Genome-wide expression Quantitative Trait Loci (eQTL) studies in humans have provided numerous insights into the genetics of both gene expression and complex diseases. While the majority of eQTL identified in genome-wide analyses impact a single gene, eQTL that impact many genes are particularly val...

Descripción completa

Detalles Bibliográficos
Autores principales: Ju, Jin Hyun, Shenoy, Sushila A., Crystal, Ronald G., Mezey, Jason G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5448815/
https://www.ncbi.nlm.nih.gov/pubmed/28505156
http://dx.doi.org/10.1371/journal.pcbi.1005537
_version_ 1783239635147685888
author Ju, Jin Hyun
Shenoy, Sushila A.
Crystal, Ronald G.
Mezey, Jason G.
author_facet Ju, Jin Hyun
Shenoy, Sushila A.
Crystal, Ronald G.
Mezey, Jason G.
author_sort Ju, Jin Hyun
collection PubMed
description Genome-wide expression Quantitative Trait Loci (eQTL) studies in humans have provided numerous insights into the genetics of both gene expression and complex diseases. While the majority of eQTL identified in genome-wide analyses impact a single gene, eQTL that impact many genes are particularly valuable for network modeling and disease analysis. To enable the identification of such broad impact eQTL, we introduce CONFETI: Confounding Factor Estimation Through Independent component analysis. CONFETI is designed to address two conflicting issues when searching for broad impact eQTL: the need to account for non-genetic confounding factors that can lower the power of the analysis or produce broad impact eQTL false positives, and the tendency of methods that account for confounding factors to model broad impact eQTL as non-genetic variation. The key advance of the CONFETI framework is the use of Independent Component Analysis (ICA) to identify variation likely caused by broad impact eQTL when constructing the sample covariance matrix used for the random effect in a mixed model. We show that CONFETI has better performance than other mixed model confounding factor methods when considering broad impact eQTL recovery from synthetic data. We also used the CONFETI framework and these same confounding factor methods to identify eQTL that replicate between matched twin pair datasets in the Multiple Tissue Human Expression Resource (MuTHER), the Depression Genes Networks study (DGN), the Netherlands Study of Depression and Anxiety (NESDA), and multiple tissue types in the Genotype-Tissue Expression (GTEx) consortium. These analyses identified both cis-eQTL and trans-eQTL impacting individual genes, and CONFETI had better or comparable performance to other mixed model confounding factor analysis methods when identifying such eQTL. In these analyses, we were able to identify and replicate a few broad impact eQTL although the overall number was small even when applying CONFETI. In light of these results, we discuss the broad impact eQTL that have been previously reported from the analysis of human data and suggest that considerable caution should be exercised when making biological inferences based on these reported eQTL.
format Online
Article
Text
id pubmed-5448815
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54488152017-06-06 An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci Ju, Jin Hyun Shenoy, Sushila A. Crystal, Ronald G. Mezey, Jason G. PLoS Comput Biol Research Article Genome-wide expression Quantitative Trait Loci (eQTL) studies in humans have provided numerous insights into the genetics of both gene expression and complex diseases. While the majority of eQTL identified in genome-wide analyses impact a single gene, eQTL that impact many genes are particularly valuable for network modeling and disease analysis. To enable the identification of such broad impact eQTL, we introduce CONFETI: Confounding Factor Estimation Through Independent component analysis. CONFETI is designed to address two conflicting issues when searching for broad impact eQTL: the need to account for non-genetic confounding factors that can lower the power of the analysis or produce broad impact eQTL false positives, and the tendency of methods that account for confounding factors to model broad impact eQTL as non-genetic variation. The key advance of the CONFETI framework is the use of Independent Component Analysis (ICA) to identify variation likely caused by broad impact eQTL when constructing the sample covariance matrix used for the random effect in a mixed model. We show that CONFETI has better performance than other mixed model confounding factor methods when considering broad impact eQTL recovery from synthetic data. We also used the CONFETI framework and these same confounding factor methods to identify eQTL that replicate between matched twin pair datasets in the Multiple Tissue Human Expression Resource (MuTHER), the Depression Genes Networks study (DGN), the Netherlands Study of Depression and Anxiety (NESDA), and multiple tissue types in the Genotype-Tissue Expression (GTEx) consortium. These analyses identified both cis-eQTL and trans-eQTL impacting individual genes, and CONFETI had better or comparable performance to other mixed model confounding factor analysis methods when identifying such eQTL. In these analyses, we were able to identify and replicate a few broad impact eQTL although the overall number was small even when applying CONFETI. In light of these results, we discuss the broad impact eQTL that have been previously reported from the analysis of human data and suggest that considerable caution should be exercised when making biological inferences based on these reported eQTL. Public Library of Science 2017-05-15 /pmc/articles/PMC5448815/ /pubmed/28505156 http://dx.doi.org/10.1371/journal.pcbi.1005537 Text en © 2017 Ju et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ju, Jin Hyun
Shenoy, Sushila A.
Crystal, Ronald G.
Mezey, Jason G.
An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci
title An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci
title_full An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci
title_fullStr An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci
title_full_unstemmed An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci
title_short An independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci
title_sort independent component analysis confounding factor correction framework for identifying broad impact expression quantitative trait loci
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5448815/
https://www.ncbi.nlm.nih.gov/pubmed/28505156
http://dx.doi.org/10.1371/journal.pcbi.1005537
work_keys_str_mv AT jujinhyun anindependentcomponentanalysisconfoundingfactorcorrectionframeworkforidentifyingbroadimpactexpressionquantitativetraitloci
AT shenoysushilaa anindependentcomponentanalysisconfoundingfactorcorrectionframeworkforidentifyingbroadimpactexpressionquantitativetraitloci
AT crystalronaldg anindependentcomponentanalysisconfoundingfactorcorrectionframeworkforidentifyingbroadimpactexpressionquantitativetraitloci
AT mezeyjasong anindependentcomponentanalysisconfoundingfactorcorrectionframeworkforidentifyingbroadimpactexpressionquantitativetraitloci
AT jujinhyun independentcomponentanalysisconfoundingfactorcorrectionframeworkforidentifyingbroadimpactexpressionquantitativetraitloci
AT shenoysushilaa independentcomponentanalysisconfoundingfactorcorrectionframeworkforidentifyingbroadimpactexpressionquantitativetraitloci
AT crystalronaldg independentcomponentanalysisconfoundingfactorcorrectionframeworkforidentifyingbroadimpactexpressionquantitativetraitloci
AT mezeyjasong independentcomponentanalysisconfoundingfactorcorrectionframeworkforidentifyingbroadimpactexpressionquantitativetraitloci