Cargando…

Information theoretic approaches for inference of biological networks from continuous-valued data

BACKGROUND: Characterising programs of gene regulation by studying individual protein-DNA and protein-protein interactions would require a large volume of high-resolution proteomics data, and such data are not yet available. Instead, many gene regulatory network (GRN) techniques have been developed,...

Descripción completa

Detalles Bibliográficos
Autores principales: Budden, David M., Crampin, Edmund J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5013667/
https://www.ncbi.nlm.nih.gov/pubmed/27599566
http://dx.doi.org/10.1186/s12918-016-0331-y
_version_ 1782452211534528512
author Budden, David M.
Crampin, Edmund J.
author_facet Budden, David M.
Crampin, Edmund J.
author_sort Budden, David M.
collection PubMed
description BACKGROUND: Characterising programs of gene regulation by studying individual protein-DNA and protein-protein interactions would require a large volume of high-resolution proteomics data, and such data are not yet available. Instead, many gene regulatory network (GRN) techniques have been developed, which leverage the wealth of transcriptomic data generated by recent consortia to study indirect, gene-level relationships between transcriptional regulators. Despite the popularity of such methods, previous methods of GRN inference exhibit limitations that we highlight and address through the lens of information theory. RESULTS: We introduce new model-free and non-linear information theoretic measures for the inference of GRNs and other biological networks from continuous-valued data. Although previous tools have implemented mutual information as a means of inferring pairwise associations, they either introduce statistical bias through discretisation or are limited to modelling undirected relationships. Our approach overcomes both of these limitations, as demonstrated by a substantial improvement in empirical performance for a set of 160 GRNs of varying size and topology. CONCLUSIONS: The information theoretic measures described in this study yield substantial improvements over previous approaches (e.g. ARACNE) and have been implemented in the latest release of NAIL (Network Analysis and Inference Library). However, despite the theoretical and empirical advantages of these new measures, they do not circumvent the fundamental limitation of indeterminacy exhibited across this class of biological networks. These methods have presently found value in computational neurobiology, and will likely gain traction for GRN analysis as the volume and quality of temporal transcriptomics data continues to improve.
format Online
Article
Text
id pubmed-5013667
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50136672016-09-08 Information theoretic approaches for inference of biological networks from continuous-valued data Budden, David M. Crampin, Edmund J. BMC Syst Biol Methodology Article BACKGROUND: Characterising programs of gene regulation by studying individual protein-DNA and protein-protein interactions would require a large volume of high-resolution proteomics data, and such data are not yet available. Instead, many gene regulatory network (GRN) techniques have been developed, which leverage the wealth of transcriptomic data generated by recent consortia to study indirect, gene-level relationships between transcriptional regulators. Despite the popularity of such methods, previous methods of GRN inference exhibit limitations that we highlight and address through the lens of information theory. RESULTS: We introduce new model-free and non-linear information theoretic measures for the inference of GRNs and other biological networks from continuous-valued data. Although previous tools have implemented mutual information as a means of inferring pairwise associations, they either introduce statistical bias through discretisation or are limited to modelling undirected relationships. Our approach overcomes both of these limitations, as demonstrated by a substantial improvement in empirical performance for a set of 160 GRNs of varying size and topology. CONCLUSIONS: The information theoretic measures described in this study yield substantial improvements over previous approaches (e.g. ARACNE) and have been implemented in the latest release of NAIL (Network Analysis and Inference Library). However, despite the theoretical and empirical advantages of these new measures, they do not circumvent the fundamental limitation of indeterminacy exhibited across this class of biological networks. These methods have presently found value in computational neurobiology, and will likely gain traction for GRN analysis as the volume and quality of temporal transcriptomics data continues to improve. BioMed Central 2016-09-06 /pmc/articles/PMC5013667/ /pubmed/27599566 http://dx.doi.org/10.1186/s12918-016-0331-y Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Budden, David M.
Crampin, Edmund J.
Information theoretic approaches for inference of biological networks from continuous-valued data
title Information theoretic approaches for inference of biological networks from continuous-valued data
title_full Information theoretic approaches for inference of biological networks from continuous-valued data
title_fullStr Information theoretic approaches for inference of biological networks from continuous-valued data
title_full_unstemmed Information theoretic approaches for inference of biological networks from continuous-valued data
title_short Information theoretic approaches for inference of biological networks from continuous-valued data
title_sort information theoretic approaches for inference of biological networks from continuous-valued data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5013667/
https://www.ncbi.nlm.nih.gov/pubmed/27599566
http://dx.doi.org/10.1186/s12918-016-0331-y
work_keys_str_mv AT buddendavidm informationtheoreticapproachesforinferenceofbiologicalnetworksfromcontinuousvalueddata
AT crampinedmundj informationtheoreticapproachesforinferenceofbiologicalnetworksfromcontinuousvalueddata