Cargando…
Bayesian Orthogonal Least Squares (BOLS) algorithm for reverse engineering of gene regulatory networks
BACKGROUND: A reverse engineering of gene regulatory network with large number of genes and limited number of experimental data points is a computationally challenging task. In particular, reverse engineering using linear systems is an underdetermined and ill conditioned problem, i.e. the amount of...
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1959566/ https://www.ncbi.nlm.nih.gov/pubmed/17626641 http://dx.doi.org/10.1186/1471-2105-8-251 |
_version_ | 1782134633601695744 |
---|---|
author | Kim, Chang Sik |
author_facet | Kim, Chang Sik |
author_sort | Kim, Chang Sik |
collection | PubMed |
description | BACKGROUND: A reverse engineering of gene regulatory network with large number of genes and limited number of experimental data points is a computationally challenging task. In particular, reverse engineering using linear systems is an underdetermined and ill conditioned problem, i.e. the amount of microarray data is limited and the solution is very sensitive to noise in the data. Therefore, the reverse engineering of gene regulatory networks with large number of genes and limited number of data points requires rigorous optimization algorithm. RESULTS: This study presents a novel algorithm for reverse engineering with linear systems. The proposed algorithm is a combination of the orthogonal least squares, second order derivative for network pruning, and Bayesian model comparison. In this study, the entire network is decomposed into a set of small networks that are defined as unit networks. The algorithm provides each unit network with P(D|H(i)), which is used as confidence level. The unit network with higher P(D|H(i)) has a higher confidence such that the unit network is correctly elucidated. Thus, the proposed algorithm is able to locate true positive interactions using P(D|H(i)), which is a unique property of the proposed algorithm. The algorithm is evaluated with synthetic and Saccharomyces cerevisiae expression data using the dynamic Bayesian network. With synthetic data, it is shown that the performance of the algorithm depends on the number of genes, noise level, and the number of data points. With Yeast expression data, it is shown that there is remarkable number of known physical or genetic events among all interactions elucidated by the proposed algorithm. The performance of the algorithm is compared with Sparse Bayesian Learning algorithm using both synthetic and Saccharomyces cerevisiae expression data sets. The comparison experiments show that the algorithm produces sparser solutions with less false positives than Sparse Bayesian Learning algorithm. CONCLUSION: From our evaluation experiments, we draw the conclusion as follows: 1) Simulation results show that the algorithm can be used to elucidate gene regulatory networks using limited number of experimental data points. 2) Simulation results also show that the algorithm is able to handle the problem with noisy data. 3) The experiment with Yeast expression data shows that the proposed algorithm reliably elucidates known physical or genetic events. 4) The comparison experiments show that the algorithm more efficiently performs than Sparse Bayesian Learning algorithm with noisy and limited number of data. |
format | Text |
id | pubmed-1959566 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-19595662007-08-31 Bayesian Orthogonal Least Squares (BOLS) algorithm for reverse engineering of gene regulatory networks Kim, Chang Sik BMC Bioinformatics Research Article BACKGROUND: A reverse engineering of gene regulatory network with large number of genes and limited number of experimental data points is a computationally challenging task. In particular, reverse engineering using linear systems is an underdetermined and ill conditioned problem, i.e. the amount of microarray data is limited and the solution is very sensitive to noise in the data. Therefore, the reverse engineering of gene regulatory networks with large number of genes and limited number of data points requires rigorous optimization algorithm. RESULTS: This study presents a novel algorithm for reverse engineering with linear systems. The proposed algorithm is a combination of the orthogonal least squares, second order derivative for network pruning, and Bayesian model comparison. In this study, the entire network is decomposed into a set of small networks that are defined as unit networks. The algorithm provides each unit network with P(D|H(i)), which is used as confidence level. The unit network with higher P(D|H(i)) has a higher confidence such that the unit network is correctly elucidated. Thus, the proposed algorithm is able to locate true positive interactions using P(D|H(i)), which is a unique property of the proposed algorithm. The algorithm is evaluated with synthetic and Saccharomyces cerevisiae expression data using the dynamic Bayesian network. With synthetic data, it is shown that the performance of the algorithm depends on the number of genes, noise level, and the number of data points. With Yeast expression data, it is shown that there is remarkable number of known physical or genetic events among all interactions elucidated by the proposed algorithm. The performance of the algorithm is compared with Sparse Bayesian Learning algorithm using both synthetic and Saccharomyces cerevisiae expression data sets. The comparison experiments show that the algorithm produces sparser solutions with less false positives than Sparse Bayesian Learning algorithm. CONCLUSION: From our evaluation experiments, we draw the conclusion as follows: 1) Simulation results show that the algorithm can be used to elucidate gene regulatory networks using limited number of experimental data points. 2) Simulation results also show that the algorithm is able to handle the problem with noisy data. 3) The experiment with Yeast expression data shows that the proposed algorithm reliably elucidates known physical or genetic events. 4) The comparison experiments show that the algorithm more efficiently performs than Sparse Bayesian Learning algorithm with noisy and limited number of data. BioMed Central 2007-07-13 /pmc/articles/PMC1959566/ /pubmed/17626641 http://dx.doi.org/10.1186/1471-2105-8-251 Text en Copyright © 2007 Kim; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Kim, Chang Sik Bayesian Orthogonal Least Squares (BOLS) algorithm for reverse engineering of gene regulatory networks |
title | Bayesian Orthogonal Least Squares (BOLS) algorithm for reverse engineering of gene regulatory networks |
title_full | Bayesian Orthogonal Least Squares (BOLS) algorithm for reverse engineering of gene regulatory networks |
title_fullStr | Bayesian Orthogonal Least Squares (BOLS) algorithm for reverse engineering of gene regulatory networks |
title_full_unstemmed | Bayesian Orthogonal Least Squares (BOLS) algorithm for reverse engineering of gene regulatory networks |
title_short | Bayesian Orthogonal Least Squares (BOLS) algorithm for reverse engineering of gene regulatory networks |
title_sort | bayesian orthogonal least squares (bols) algorithm for reverse engineering of gene regulatory networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1959566/ https://www.ncbi.nlm.nih.gov/pubmed/17626641 http://dx.doi.org/10.1186/1471-2105-8-251 |
work_keys_str_mv | AT kimchangsik bayesianorthogonalleastsquaresbolsalgorithmforreverseengineeringofgeneregulatorynetworks |