Cargando…
An integer optimization algorithm for robust identification of non-linear gene regulatory networks
BACKGROUND: Reverse engineering gene networks and identifying regulatory interactions are integral to understanding cellular decision making processes. Advancement in high throughput experimental techniques has initiated innovative data driven analysis of gene regulatory networks. However, inherent...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3444924/ https://www.ncbi.nlm.nih.gov/pubmed/22937832 http://dx.doi.org/10.1186/1752-0509-6-119 |
_version_ | 1782243740672327680 |
---|---|
author | Chemmangattuvalappil, Nishanth Task, Keith Banerjee, Ipsita |
author_facet | Chemmangattuvalappil, Nishanth Task, Keith Banerjee, Ipsita |
author_sort | Chemmangattuvalappil, Nishanth |
collection | PubMed |
description | BACKGROUND: Reverse engineering gene networks and identifying regulatory interactions are integral to understanding cellular decision making processes. Advancement in high throughput experimental techniques has initiated innovative data driven analysis of gene regulatory networks. However, inherent noise associated with biological systems requires numerous experimental replicates for reliable conclusions. Furthermore, evidence of robust algorithms directly exploiting basic biological traits are few. Such algorithms are expected to be efficient in their performance and robust in their prediction. RESULTS: We have developed a network identification algorithm to accurately infer both the topology and strength of regulatory interactions from time series gene expression data in the presence of significant experimental noise and non-linear behavior. In this novel formulism, we have addressed data variability in biological systems by integrating network identification with the bootstrap resampling technique, hence predicting robust interactions from limited experimental replicates subjected to noise. Furthermore, we have incorporated non-linearity in gene dynamics using the S-system formulation. The basic network identification formulation exploits the trait of sparsity of biological interactions. Towards that, the identification algorithm is formulated as an integer-programming problem by introducing binary variables for each network component. The objective function is targeted to minimize the network connections subjected to the constraint of maximal agreement between the experimental and predicted gene dynamics. The developed algorithm is validated using both in silico and experimental data-sets. These studies show that the algorithm can accurately predict the topology and connection strength of the in silico networks, as quantified by high precision and recall, and small discrepancy between the actual and predicted kinetic parameters. Furthermore, in both the in silico and experimental case studies, the predicted gene expression profiles are in very close agreement with the dynamics of the input data. CONCLUSIONS: Our integer programming algorithm effectively utilizes bootstrapping to identify robust gene regulatory networks from noisy, non-linear time-series gene expression data. With significant noise and non-linearities being inherent to biological systems, the present formulism, with the incorporation of network sparsity, is extremely relevant to gene regulatory networks, and while the formulation has been validated against in silico and E. Coli data, it can be applied to any biological system. |
format | Online Article Text |
id | pubmed-3444924 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-34449242012-09-21 An integer optimization algorithm for robust identification of non-linear gene regulatory networks Chemmangattuvalappil, Nishanth Task, Keith Banerjee, Ipsita BMC Syst Biol Research Article BACKGROUND: Reverse engineering gene networks and identifying regulatory interactions are integral to understanding cellular decision making processes. Advancement in high throughput experimental techniques has initiated innovative data driven analysis of gene regulatory networks. However, inherent noise associated with biological systems requires numerous experimental replicates for reliable conclusions. Furthermore, evidence of robust algorithms directly exploiting basic biological traits are few. Such algorithms are expected to be efficient in their performance and robust in their prediction. RESULTS: We have developed a network identification algorithm to accurately infer both the topology and strength of regulatory interactions from time series gene expression data in the presence of significant experimental noise and non-linear behavior. In this novel formulism, we have addressed data variability in biological systems by integrating network identification with the bootstrap resampling technique, hence predicting robust interactions from limited experimental replicates subjected to noise. Furthermore, we have incorporated non-linearity in gene dynamics using the S-system formulation. The basic network identification formulation exploits the trait of sparsity of biological interactions. Towards that, the identification algorithm is formulated as an integer-programming problem by introducing binary variables for each network component. The objective function is targeted to minimize the network connections subjected to the constraint of maximal agreement between the experimental and predicted gene dynamics. The developed algorithm is validated using both in silico and experimental data-sets. These studies show that the algorithm can accurately predict the topology and connection strength of the in silico networks, as quantified by high precision and recall, and small discrepancy between the actual and predicted kinetic parameters. Furthermore, in both the in silico and experimental case studies, the predicted gene expression profiles are in very close agreement with the dynamics of the input data. CONCLUSIONS: Our integer programming algorithm effectively utilizes bootstrapping to identify robust gene regulatory networks from noisy, non-linear time-series gene expression data. With significant noise and non-linearities being inherent to biological systems, the present formulism, with the incorporation of network sparsity, is extremely relevant to gene regulatory networks, and while the formulation has been validated against in silico and E. Coli data, it can be applied to any biological system. BioMed Central 2012-09-02 /pmc/articles/PMC3444924/ /pubmed/22937832 http://dx.doi.org/10.1186/1752-0509-6-119 Text en Copyright ©2012 Chemmangattuvalappil et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Chemmangattuvalappil, Nishanth Task, Keith Banerjee, Ipsita An integer optimization algorithm for robust identification of non-linear gene regulatory networks |
title | An integer optimization algorithm for robust identification of non-linear gene regulatory networks |
title_full | An integer optimization algorithm for robust identification of non-linear gene regulatory networks |
title_fullStr | An integer optimization algorithm for robust identification of non-linear gene regulatory networks |
title_full_unstemmed | An integer optimization algorithm for robust identification of non-linear gene regulatory networks |
title_short | An integer optimization algorithm for robust identification of non-linear gene regulatory networks |
title_sort | integer optimization algorithm for robust identification of non-linear gene regulatory networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3444924/ https://www.ncbi.nlm.nih.gov/pubmed/22937832 http://dx.doi.org/10.1186/1752-0509-6-119 |
work_keys_str_mv | AT chemmangattuvalappilnishanth anintegeroptimizationalgorithmforrobustidentificationofnonlineargeneregulatorynetworks AT taskkeith anintegeroptimizationalgorithmforrobustidentificationofnonlineargeneregulatorynetworks AT banerjeeipsita anintegeroptimizationalgorithmforrobustidentificationofnonlineargeneregulatorynetworks AT chemmangattuvalappilnishanth integeroptimizationalgorithmforrobustidentificationofnonlineargeneregulatorynetworks AT taskkeith integeroptimizationalgorithmforrobustidentificationofnonlineargeneregulatorynetworks AT banerjeeipsita integeroptimizationalgorithmforrobustidentificationofnonlineargeneregulatorynetworks |