Cargando…
Stability of building gene regulatory networks with sparse autoregressive models
BACKGROUND: Biological networks are constantly subjected to random perturbations, and efficient feedback and compensatory mechanisms exist to maintain their stability. There is an increased interest in building gene regulatory networks (GRNs) from temporal gene expression data because of their numer...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278833/ https://www.ncbi.nlm.nih.gov/pubmed/22373004 http://dx.doi.org/10.1186/1471-2105-12-S13-S17 |
_version_ | 1782223612192751616 |
---|---|
author | Rajapakse, Jagath C Mundra, Piyushkumar A |
author_facet | Rajapakse, Jagath C Mundra, Piyushkumar A |
author_sort | Rajapakse, Jagath C |
collection | PubMed |
description | BACKGROUND: Biological networks are constantly subjected to random perturbations, and efficient feedback and compensatory mechanisms exist to maintain their stability. There is an increased interest in building gene regulatory networks (GRNs) from temporal gene expression data because of their numerous applications in life sciences. However, because of the limited number of time points at which gene expressions can be gathered in practice, computational techniques of building GRN often lead to inaccuracies and instabilities. This paper investigates the stability of sparse auto-regressive models of building GRN from gene expression data. RESULTS: Criteria for evaluating the stability of estimating GRN structure are proposed. Thereby, stability of multivariate vector autoregressive (MVAR) methods - ridge, lasso, and elastic-net - of building GRN were studied by simulating temporal gene expression datasets on scale-free topologies as well as on real data gathered over Hela cell-cycle. Effects of the number of time points on the stability of constructing GRN are investigated. When the number of time points are relatively low compared to the size of network, both accuracy and stability are adversely affected. At least, the number of time points equal to the number of genes in the network are needed to achieve decent accuracy and stability of the networks. Our results on synthetic data indicate that the stability of lasso and elastic-net MVAR methods are comparable, and their accuracies are much higher than the ridge MVAR. As the size of the network grows, the number of time points required to achieve acceptable accuracy and stability are much less relative to the number of genes in the network. The effects of false negatives are easier to improve by increasing the number time points than those due to false positives. Application to HeLa cell-cycle gene expression dataset shows that biologically stable GRN can be obtained by introducing perturbations to the data. CONCLUSIONS: Accuracy and stability of building GRN are crucial for investigation of gene regulations. Sparse MVAR techniques such as lasso and elastic-net provide accurate and stable methods for building even GRN of small size. The effect of false negatives is corrected much easier with the increased number of time points than those due to false positives. With real data, we demonstrate how stable networks can be derived by introducing random perturbation to data. |
format | Online Article Text |
id | pubmed-3278833 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32788332012-02-14 Stability of building gene regulatory networks with sparse autoregressive models Rajapakse, Jagath C Mundra, Piyushkumar A BMC Bioinformatics Proceedings BACKGROUND: Biological networks are constantly subjected to random perturbations, and efficient feedback and compensatory mechanisms exist to maintain their stability. There is an increased interest in building gene regulatory networks (GRNs) from temporal gene expression data because of their numerous applications in life sciences. However, because of the limited number of time points at which gene expressions can be gathered in practice, computational techniques of building GRN often lead to inaccuracies and instabilities. This paper investigates the stability of sparse auto-regressive models of building GRN from gene expression data. RESULTS: Criteria for evaluating the stability of estimating GRN structure are proposed. Thereby, stability of multivariate vector autoregressive (MVAR) methods - ridge, lasso, and elastic-net - of building GRN were studied by simulating temporal gene expression datasets on scale-free topologies as well as on real data gathered over Hela cell-cycle. Effects of the number of time points on the stability of constructing GRN are investigated. When the number of time points are relatively low compared to the size of network, both accuracy and stability are adversely affected. At least, the number of time points equal to the number of genes in the network are needed to achieve decent accuracy and stability of the networks. Our results on synthetic data indicate that the stability of lasso and elastic-net MVAR methods are comparable, and their accuracies are much higher than the ridge MVAR. As the size of the network grows, the number of time points required to achieve acceptable accuracy and stability are much less relative to the number of genes in the network. The effects of false negatives are easier to improve by increasing the number time points than those due to false positives. Application to HeLa cell-cycle gene expression dataset shows that biologically stable GRN can be obtained by introducing perturbations to the data. CONCLUSIONS: Accuracy and stability of building GRN are crucial for investigation of gene regulations. Sparse MVAR techniques such as lasso and elastic-net provide accurate and stable methods for building even GRN of small size. The effect of false negatives is corrected much easier with the increased number of time points than those due to false positives. With real data, we demonstrate how stable networks can be derived by introducing random perturbation to data. BioMed Central 2011-11-30 /pmc/articles/PMC3278833/ /pubmed/22373004 http://dx.doi.org/10.1186/1471-2105-12-S13-S17 Text en Copyright ©2011 Rajapakse and Mundra; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Rajapakse, Jagath C Mundra, Piyushkumar A Stability of building gene regulatory networks with sparse autoregressive models |
title | Stability of building gene regulatory networks with sparse autoregressive models |
title_full | Stability of building gene regulatory networks with sparse autoregressive models |
title_fullStr | Stability of building gene regulatory networks with sparse autoregressive models |
title_full_unstemmed | Stability of building gene regulatory networks with sparse autoregressive models |
title_short | Stability of building gene regulatory networks with sparse autoregressive models |
title_sort | stability of building gene regulatory networks with sparse autoregressive models |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278833/ https://www.ncbi.nlm.nih.gov/pubmed/22373004 http://dx.doi.org/10.1186/1471-2105-12-S13-S17 |
work_keys_str_mv | AT rajapaksejagathc stabilityofbuildinggeneregulatorynetworkswithsparseautoregressivemodels AT mundrapiyushkumara stabilityofbuildinggeneregulatorynetworkswithsparseautoregressivemodels |