Cargando…

Disentangling molecular relationships with a causal inference test

BACKGROUND: There has been intense effort over the past couple of decades to identify loci underlying quantitative traits as a key step in the process of elucidating the etiology of complex diseases. Recently there has been some effort to coalesce non-biased high-throughput data, e.g. high density g...

Descripción completa

Detalles Bibliográficos
Autores principales: Millstein, Joshua, Zhang, Bin, Zhu, Jun, Schadt, Eric E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224661/
https://www.ncbi.nlm.nih.gov/pubmed/19473544
http://dx.doi.org/10.1186/1471-2156-10-23
_version_ 1782217423385001984
author Millstein, Joshua
Zhang, Bin
Zhu, Jun
Schadt, Eric E
author_facet Millstein, Joshua
Zhang, Bin
Zhu, Jun
Schadt, Eric E
author_sort Millstein, Joshua
collection PubMed
description BACKGROUND: There has been intense effort over the past couple of decades to identify loci underlying quantitative traits as a key step in the process of elucidating the etiology of complex diseases. Recently there has been some effort to coalesce non-biased high-throughput data, e.g. high density genotyping and genome wide RNA expression, to drive understanding of the molecular basis of disease. However, a stumbling block has been the difficult question of how to leverage this information to identify molecular mechanisms that explain quantitative trait loci (QTL). We have developed a formal statistical hypothesis test, resulting in a p-value, to quantify uncertainty in a causal inference pertaining to a measured factor, e.g. a molecular species, which potentially mediates a known causal association between a locus and a quantitative trait. RESULTS: We treat the causal inference as a 'chain' of mathematical conditions that must be satisfied to conclude that the potential mediator is causal for the trait, where the inference is only as good as the weakest link in the chain. P-values are computed for the component conditions, which include tests of linkage and conditional independence. The Intersection-Union Test, in which a series of statistical tests are combined to form an omnibus test, is then employed to generate the overall test result. Using computer simulated mouse crosses, we show that type I error is low under a variety of conditions that include hidden variables and reactive pathways. We show that power under a simple causal model is comparable to other model selection techniques as well as Bayesian network reconstruction methods. We further show empirically that this method compares favorably to Bayesian network reconstruction methods for reconstructing transcriptional regulatory networks in yeast, recovering 7 out of 8 experimentally validated regulators. CONCLUSION: Here we propose a novel statistical framework in which existing notions of causal mediation are formalized into a hypothesis test, thus providing a standard quantitative measure of uncertainty in the form of a p-value. The method is theoretically and computationally accessible and with the provided software may prove a useful tool in disentangling molecular relationships.
format Online
Article
Text
id pubmed-3224661
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32246612011-11-28 Disentangling molecular relationships with a causal inference test Millstein, Joshua Zhang, Bin Zhu, Jun Schadt, Eric E BMC Genet Methodology Article BACKGROUND: There has been intense effort over the past couple of decades to identify loci underlying quantitative traits as a key step in the process of elucidating the etiology of complex diseases. Recently there has been some effort to coalesce non-biased high-throughput data, e.g. high density genotyping and genome wide RNA expression, to drive understanding of the molecular basis of disease. However, a stumbling block has been the difficult question of how to leverage this information to identify molecular mechanisms that explain quantitative trait loci (QTL). We have developed a formal statistical hypothesis test, resulting in a p-value, to quantify uncertainty in a causal inference pertaining to a measured factor, e.g. a molecular species, which potentially mediates a known causal association between a locus and a quantitative trait. RESULTS: We treat the causal inference as a 'chain' of mathematical conditions that must be satisfied to conclude that the potential mediator is causal for the trait, where the inference is only as good as the weakest link in the chain. P-values are computed for the component conditions, which include tests of linkage and conditional independence. The Intersection-Union Test, in which a series of statistical tests are combined to form an omnibus test, is then employed to generate the overall test result. Using computer simulated mouse crosses, we show that type I error is low under a variety of conditions that include hidden variables and reactive pathways. We show that power under a simple causal model is comparable to other model selection techniques as well as Bayesian network reconstruction methods. We further show empirically that this method compares favorably to Bayesian network reconstruction methods for reconstructing transcriptional regulatory networks in yeast, recovering 7 out of 8 experimentally validated regulators. CONCLUSION: Here we propose a novel statistical framework in which existing notions of causal mediation are formalized into a hypothesis test, thus providing a standard quantitative measure of uncertainty in the form of a p-value. The method is theoretically and computationally accessible and with the provided software may prove a useful tool in disentangling molecular relationships. BioMed Central 2009-05-27 /pmc/articles/PMC3224661/ /pubmed/19473544 http://dx.doi.org/10.1186/1471-2156-10-23 Text en Copyright ©2009 Millstein et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Millstein, Joshua
Zhang, Bin
Zhu, Jun
Schadt, Eric E
Disentangling molecular relationships with a causal inference test
title Disentangling molecular relationships with a causal inference test
title_full Disentangling molecular relationships with a causal inference test
title_fullStr Disentangling molecular relationships with a causal inference test
title_full_unstemmed Disentangling molecular relationships with a causal inference test
title_short Disentangling molecular relationships with a causal inference test
title_sort disentangling molecular relationships with a causal inference test
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224661/
https://www.ncbi.nlm.nih.gov/pubmed/19473544
http://dx.doi.org/10.1186/1471-2156-10-23
work_keys_str_mv AT millsteinjoshua disentanglingmolecularrelationshipswithacausalinferencetest
AT zhangbin disentanglingmolecularrelationshipswithacausalinferencetest
AT zhujun disentanglingmolecularrelationshipswithacausalinferencetest
AT schadterice disentanglingmolecularrelationshipswithacausalinferencetest