Cargando…
Learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information
BACKGROUND: Probability based statistical learning methods such as mutual information and Bayesian networks have emerged as a major category of tools for reverse engineering mechanistic relationships from quantitative biological data. In this work we introduce a new statistical learning strategy, MI...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2613931/ https://www.ncbi.nlm.nih.gov/pubmed/18980677 http://dx.doi.org/10.1186/1471-2105-9-467 |
_version_ | 1782163214724759552 |
---|---|
author | Luo, Weijun Hankenson, Kurt D Woolf, Peter J |
author_facet | Luo, Weijun Hankenson, Kurt D Woolf, Peter J |
author_sort | Luo, Weijun |
collection | PubMed |
description | BACKGROUND: Probability based statistical learning methods such as mutual information and Bayesian networks have emerged as a major category of tools for reverse engineering mechanistic relationships from quantitative biological data. In this work we introduce a new statistical learning strategy, MI3 that addresses three common issues in previous methods simultaneously: (1) handling of continuous variables, (2) detection of more complex three-way relationships and (3) better differentiation of causal versus confounding relationships. With these improvements, we provide a more realistic representation of the underlying biological system. RESULTS: We test the MI3 algorithm using both synthetic and experimental data. In the synthetic data experiment, MI3 achieved an absolute sensitivity/precision of 0.77/0.83 and a relative sensitivity/precision both of 0.99. In addition, MI3 significantly outperformed the control methods, including Bayesian networks, classical two-way mutual information and a discrete version of MI3. We then used MI3 and control methods to infer a regulatory network centered at the MYC transcription factor from a published microarray dataset. Models selected by MI3 were numerically and biologically distinct from those selected by control methods. Unlike control methods, MI3 effectively differentiated true causal models from confounding models. MI3 recovered major MYC cofactors, and revealed major mechanisms involved in MYC dependent transcriptional regulation, which are strongly supported by literature. The MI3 network showed that limited sets of regulatory mechanisms are employed repeatedly to control the expression of large number of genes. CONCLUSION: Overall, our work demonstrates that MI3 outperforms the frequently used control methods, and provides a powerful method for inferring mechanistic relationships underlying biological and other complex systems. The MI3 method is implemented in R in the "mi3" package, available under the GNU GPL from and from the R package archive CRAN. |
format | Text |
id | pubmed-2613931 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26139312009-01-12 Learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information Luo, Weijun Hankenson, Kurt D Woolf, Peter J BMC Bioinformatics Research Article BACKGROUND: Probability based statistical learning methods such as mutual information and Bayesian networks have emerged as a major category of tools for reverse engineering mechanistic relationships from quantitative biological data. In this work we introduce a new statistical learning strategy, MI3 that addresses three common issues in previous methods simultaneously: (1) handling of continuous variables, (2) detection of more complex three-way relationships and (3) better differentiation of causal versus confounding relationships. With these improvements, we provide a more realistic representation of the underlying biological system. RESULTS: We test the MI3 algorithm using both synthetic and experimental data. In the synthetic data experiment, MI3 achieved an absolute sensitivity/precision of 0.77/0.83 and a relative sensitivity/precision both of 0.99. In addition, MI3 significantly outperformed the control methods, including Bayesian networks, classical two-way mutual information and a discrete version of MI3. We then used MI3 and control methods to infer a regulatory network centered at the MYC transcription factor from a published microarray dataset. Models selected by MI3 were numerically and biologically distinct from those selected by control methods. Unlike control methods, MI3 effectively differentiated true causal models from confounding models. MI3 recovered major MYC cofactors, and revealed major mechanisms involved in MYC dependent transcriptional regulation, which are strongly supported by literature. The MI3 network showed that limited sets of regulatory mechanisms are employed repeatedly to control the expression of large number of genes. CONCLUSION: Overall, our work demonstrates that MI3 outperforms the frequently used control methods, and provides a powerful method for inferring mechanistic relationships underlying biological and other complex systems. The MI3 method is implemented in R in the "mi3" package, available under the GNU GPL from and from the R package archive CRAN. BioMed Central 2008-11-03 /pmc/articles/PMC2613931/ /pubmed/18980677 http://dx.doi.org/10.1186/1471-2105-9-467 Text en Copyright © 2008 Luo et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Luo, Weijun Hankenson, Kurt D Woolf, Peter J Learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information |
title | Learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information |
title_full | Learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information |
title_fullStr | Learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information |
title_full_unstemmed | Learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information |
title_short | Learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information |
title_sort | learning transcriptional regulatory networks from high throughput gene expression data using continuous three-way mutual information |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2613931/ https://www.ncbi.nlm.nih.gov/pubmed/18980677 http://dx.doi.org/10.1186/1471-2105-9-467 |
work_keys_str_mv | AT luoweijun learningtranscriptionalregulatorynetworksfromhighthroughputgeneexpressiondatausingcontinuousthreewaymutualinformation AT hankensonkurtd learningtranscriptionalregulatorynetworksfromhighthroughputgeneexpressiondatausingcontinuousthreewaymutualinformation AT woolfpeterj learningtranscriptionalregulatorynetworksfromhighthroughputgeneexpressiondatausingcontinuousthreewaymutualinformation |