Cargando…
Usefulness and limitations of dK random graph models to predict interactions and functional homogeneity in biological networks under a pseudo-likelihood parameter estimation approach
BACKGROUND: Many aspects of biological functions can be modeled by biological networks, such as protein interaction networks, metabolic networks, and gene coexpression networks. Studying the statistical properties of these networks in turn allows us to infer biological function. Complex statistical...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2755484/ https://www.ncbi.nlm.nih.gov/pubmed/19728875 http://dx.doi.org/10.1186/1471-2105-10-277 |
_version_ | 1782172453595774976 |
---|---|
author | Wang, Wenhui Nunez-Iglesias, Juan Luan, Yihui Sun, Fengzhu |
author_facet | Wang, Wenhui Nunez-Iglesias, Juan Luan, Yihui Sun, Fengzhu |
author_sort | Wang, Wenhui |
collection | PubMed |
description | BACKGROUND: Many aspects of biological functions can be modeled by biological networks, such as protein interaction networks, metabolic networks, and gene coexpression networks. Studying the statistical properties of these networks in turn allows us to infer biological function. Complex statistical network models can potentially more accurately describe the networks, but it is not clear whether such complex models are better suited to find biologically meaningful subnetworks. RESULTS: Recent studies have shown that the degree distribution of the nodes is not an adequate statistic in many molecular networks. We sought to extend this statistic with 2nd and 3rd order degree correlations and developed a pseudo-likelihood approach to estimate the parameters. The approach was used to analyze the MIPS and BIOGRID yeast protein interaction networks, and two yeast coexpression networks. We showed that 2nd order degree correlation information gave better predictions of gene interactions in both protein interaction and gene coexpression networks. However, in the biologically important task of predicting functionally homogeneous modules, degree correlation information performs marginally better in the case of the MIPS and BIOGRID protein interaction networks, but worse in the case of gene coexpression networks. CONCLUSION: Our use of dK models showed that incorporation of degree correlations could increase predictive power in some contexts, albeit sometimes marginally, but, in all contexts, the use of third-order degree correlations decreased accuracy. However, it is possible that other parameter estimation methods, such as maximum likelihood, will show the usefulness of incorporating 2nd and 3rd degree correlations in predicting functionally homogeneous modules. |
format | Text |
id | pubmed-2755484 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27554842009-10-02 Usefulness and limitations of dK random graph models to predict interactions and functional homogeneity in biological networks under a pseudo-likelihood parameter estimation approach Wang, Wenhui Nunez-Iglesias, Juan Luan, Yihui Sun, Fengzhu BMC Bioinformatics Research Article BACKGROUND: Many aspects of biological functions can be modeled by biological networks, such as protein interaction networks, metabolic networks, and gene coexpression networks. Studying the statistical properties of these networks in turn allows us to infer biological function. Complex statistical network models can potentially more accurately describe the networks, but it is not clear whether such complex models are better suited to find biologically meaningful subnetworks. RESULTS: Recent studies have shown that the degree distribution of the nodes is not an adequate statistic in many molecular networks. We sought to extend this statistic with 2nd and 3rd order degree correlations and developed a pseudo-likelihood approach to estimate the parameters. The approach was used to analyze the MIPS and BIOGRID yeast protein interaction networks, and two yeast coexpression networks. We showed that 2nd order degree correlation information gave better predictions of gene interactions in both protein interaction and gene coexpression networks. However, in the biologically important task of predicting functionally homogeneous modules, degree correlation information performs marginally better in the case of the MIPS and BIOGRID protein interaction networks, but worse in the case of gene coexpression networks. CONCLUSION: Our use of dK models showed that incorporation of degree correlations could increase predictive power in some contexts, albeit sometimes marginally, but, in all contexts, the use of third-order degree correlations decreased accuracy. However, it is possible that other parameter estimation methods, such as maximum likelihood, will show the usefulness of incorporating 2nd and 3rd degree correlations in predicting functionally homogeneous modules. BioMed Central 2009-09-03 /pmc/articles/PMC2755484/ /pubmed/19728875 http://dx.doi.org/10.1186/1471-2105-10-277 Text en Copyright © 2009 Wang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Wang, Wenhui Nunez-Iglesias, Juan Luan, Yihui Sun, Fengzhu Usefulness and limitations of dK random graph models to predict interactions and functional homogeneity in biological networks under a pseudo-likelihood parameter estimation approach |
title | Usefulness and limitations of dK random graph models to predict interactions and functional homogeneity in biological networks under a pseudo-likelihood parameter estimation approach |
title_full | Usefulness and limitations of dK random graph models to predict interactions and functional homogeneity in biological networks under a pseudo-likelihood parameter estimation approach |
title_fullStr | Usefulness and limitations of dK random graph models to predict interactions and functional homogeneity in biological networks under a pseudo-likelihood parameter estimation approach |
title_full_unstemmed | Usefulness and limitations of dK random graph models to predict interactions and functional homogeneity in biological networks under a pseudo-likelihood parameter estimation approach |
title_short | Usefulness and limitations of dK random graph models to predict interactions and functional homogeneity in biological networks under a pseudo-likelihood parameter estimation approach |
title_sort | usefulness and limitations of dk random graph models to predict interactions and functional homogeneity in biological networks under a pseudo-likelihood parameter estimation approach |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2755484/ https://www.ncbi.nlm.nih.gov/pubmed/19728875 http://dx.doi.org/10.1186/1471-2105-10-277 |
work_keys_str_mv | AT wangwenhui usefulnessandlimitationsofdkrandomgraphmodelstopredictinteractionsandfunctionalhomogeneityinbiologicalnetworksunderapseudolikelihoodparameterestimationapproach AT nuneziglesiasjuan usefulnessandlimitationsofdkrandomgraphmodelstopredictinteractionsandfunctionalhomogeneityinbiologicalnetworksunderapseudolikelihoodparameterestimationapproach AT luanyihui usefulnessandlimitationsofdkrandomgraphmodelstopredictinteractionsandfunctionalhomogeneityinbiologicalnetworksunderapseudolikelihoodparameterestimationapproach AT sunfengzhu usefulnessandlimitationsofdkrandomgraphmodelstopredictinteractionsandfunctionalhomogeneityinbiologicalnetworksunderapseudolikelihoodparameterestimationapproach |