Cargando…
Information theoretic approaches for inference of biological networks from continuous-valued data
BACKGROUND: Characterising programs of gene regulation by studying individual protein-DNA and protein-protein interactions would require a large volume of high-resolution proteomics data, and such data are not yet available. Instead, many gene regulatory network (GRN) techniques have been developed,...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5013667/ https://www.ncbi.nlm.nih.gov/pubmed/27599566 http://dx.doi.org/10.1186/s12918-016-0331-y |
_version_ | 1782452211534528512 |
---|---|
author | Budden, David M. Crampin, Edmund J. |
author_facet | Budden, David M. Crampin, Edmund J. |
author_sort | Budden, David M. |
collection | PubMed |
description | BACKGROUND: Characterising programs of gene regulation by studying individual protein-DNA and protein-protein interactions would require a large volume of high-resolution proteomics data, and such data are not yet available. Instead, many gene regulatory network (GRN) techniques have been developed, which leverage the wealth of transcriptomic data generated by recent consortia to study indirect, gene-level relationships between transcriptional regulators. Despite the popularity of such methods, previous methods of GRN inference exhibit limitations that we highlight and address through the lens of information theory. RESULTS: We introduce new model-free and non-linear information theoretic measures for the inference of GRNs and other biological networks from continuous-valued data. Although previous tools have implemented mutual information as a means of inferring pairwise associations, they either introduce statistical bias through discretisation or are limited to modelling undirected relationships. Our approach overcomes both of these limitations, as demonstrated by a substantial improvement in empirical performance for a set of 160 GRNs of varying size and topology. CONCLUSIONS: The information theoretic measures described in this study yield substantial improvements over previous approaches (e.g. ARACNE) and have been implemented in the latest release of NAIL (Network Analysis and Inference Library). However, despite the theoretical and empirical advantages of these new measures, they do not circumvent the fundamental limitation of indeterminacy exhibited across this class of biological networks. These methods have presently found value in computational neurobiology, and will likely gain traction for GRN analysis as the volume and quality of temporal transcriptomics data continues to improve. |
format | Online Article Text |
id | pubmed-5013667 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-50136672016-09-08 Information theoretic approaches for inference of biological networks from continuous-valued data Budden, David M. Crampin, Edmund J. BMC Syst Biol Methodology Article BACKGROUND: Characterising programs of gene regulation by studying individual protein-DNA and protein-protein interactions would require a large volume of high-resolution proteomics data, and such data are not yet available. Instead, many gene regulatory network (GRN) techniques have been developed, which leverage the wealth of transcriptomic data generated by recent consortia to study indirect, gene-level relationships between transcriptional regulators. Despite the popularity of such methods, previous methods of GRN inference exhibit limitations that we highlight and address through the lens of information theory. RESULTS: We introduce new model-free and non-linear information theoretic measures for the inference of GRNs and other biological networks from continuous-valued data. Although previous tools have implemented mutual information as a means of inferring pairwise associations, they either introduce statistical bias through discretisation or are limited to modelling undirected relationships. Our approach overcomes both of these limitations, as demonstrated by a substantial improvement in empirical performance for a set of 160 GRNs of varying size and topology. CONCLUSIONS: The information theoretic measures described in this study yield substantial improvements over previous approaches (e.g. ARACNE) and have been implemented in the latest release of NAIL (Network Analysis and Inference Library). However, despite the theoretical and empirical advantages of these new measures, they do not circumvent the fundamental limitation of indeterminacy exhibited across this class of biological networks. These methods have presently found value in computational neurobiology, and will likely gain traction for GRN analysis as the volume and quality of temporal transcriptomics data continues to improve. BioMed Central 2016-09-06 /pmc/articles/PMC5013667/ /pubmed/27599566 http://dx.doi.org/10.1186/s12918-016-0331-y Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Budden, David M. Crampin, Edmund J. Information theoretic approaches for inference of biological networks from continuous-valued data |
title | Information theoretic approaches for inference of biological networks from continuous-valued data |
title_full | Information theoretic approaches for inference of biological networks from continuous-valued data |
title_fullStr | Information theoretic approaches for inference of biological networks from continuous-valued data |
title_full_unstemmed | Information theoretic approaches for inference of biological networks from continuous-valued data |
title_short | Information theoretic approaches for inference of biological networks from continuous-valued data |
title_sort | information theoretic approaches for inference of biological networks from continuous-valued data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5013667/ https://www.ncbi.nlm.nih.gov/pubmed/27599566 http://dx.doi.org/10.1186/s12918-016-0331-y |
work_keys_str_mv | AT buddendavidm informationtheoreticapproachesforinferenceofbiologicalnetworksfromcontinuousvalueddata AT crampinedmundj informationtheoreticapproachesforinferenceofbiologicalnetworksfromcontinuousvalueddata |