Cargando…
Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms
BACKGROUND: Protein-protein interactions play vital roles in nearly all cellular processes and are involved in the construction of biological pathways such as metabolic and signal transduction pathways. Although large-scale experiments have enabled the discovery of thousands of previously unknown li...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2681066/ https://www.ncbi.nlm.nih.gov/pubmed/19426453 http://dx.doi.org/10.1186/1471-2105-10-S4-S5 |
_version_ | 1782167009194147840 |
---|---|
author | Lin, Xiaotong Liu, Mei Chen, Xue-wen |
author_facet | Lin, Xiaotong Liu, Mei Chen, Xue-wen |
author_sort | Lin, Xiaotong |
collection | PubMed |
description | BACKGROUND: Protein-protein interactions play vital roles in nearly all cellular processes and are involved in the construction of biological pathways such as metabolic and signal transduction pathways. Although large-scale experiments have enabled the discovery of thousands of previously unknown linkages among proteins in many organisms, the high-throughput interaction data is often associated with high error rates. Since protein interaction networks have been utilized in numerous biological inferences, the inclusive experimental errors inevitably affect the quality of such prediction. Thus, it is essential to assess the quality of the protein interaction data. RESULTS: In this paper, a novel Bayesian network-based integrative framework is proposed to assess the reliability of protein-protein interactions. We develop a cross-species in silico model that assigns likelihood scores to individual protein pairs based on the information entirely extracted from model organisms. Our proposed approach integrates multiple microarray datasets and novel features derived from gene ontology. Furthermore, the confidence scores for cross-species protein mappings are explicitly incorporated into our model. Applying our model to predict protein interactions in the human genome, we are able to achieve 80% in sensitivity and 70% in specificity. Finally, we assess the overall quality of the experimentally determined yeast protein-protein interaction dataset. We observe that the more high-throughput experiments confirming an interaction, the higher the likelihood score, which confirms the effectiveness of our approach. CONCLUSION: This study demonstrates that model organisms certainly provide important information for protein-protein interaction inference and assessment. The proposed method is able to assess not only the overall quality of an interaction dataset, but also the quality of individual protein-protein interactions. We expect the method to continually improve as more high quality interaction data from more model organisms becomes available and is readily scalable to a genome-wide application. |
format | Text |
id | pubmed-2681066 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26810662009-05-13 Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms Lin, Xiaotong Liu, Mei Chen, Xue-wen BMC Bioinformatics Proceedings BACKGROUND: Protein-protein interactions play vital roles in nearly all cellular processes and are involved in the construction of biological pathways such as metabolic and signal transduction pathways. Although large-scale experiments have enabled the discovery of thousands of previously unknown linkages among proteins in many organisms, the high-throughput interaction data is often associated with high error rates. Since protein interaction networks have been utilized in numerous biological inferences, the inclusive experimental errors inevitably affect the quality of such prediction. Thus, it is essential to assess the quality of the protein interaction data. RESULTS: In this paper, a novel Bayesian network-based integrative framework is proposed to assess the reliability of protein-protein interactions. We develop a cross-species in silico model that assigns likelihood scores to individual protein pairs based on the information entirely extracted from model organisms. Our proposed approach integrates multiple microarray datasets and novel features derived from gene ontology. Furthermore, the confidence scores for cross-species protein mappings are explicitly incorporated into our model. Applying our model to predict protein interactions in the human genome, we are able to achieve 80% in sensitivity and 70% in specificity. Finally, we assess the overall quality of the experimentally determined yeast protein-protein interaction dataset. We observe that the more high-throughput experiments confirming an interaction, the higher the likelihood score, which confirms the effectiveness of our approach. CONCLUSION: This study demonstrates that model organisms certainly provide important information for protein-protein interaction inference and assessment. The proposed method is able to assess not only the overall quality of an interaction dataset, but also the quality of individual protein-protein interactions. We expect the method to continually improve as more high quality interaction data from more model organisms becomes available and is readily scalable to a genome-wide application. BioMed Central 2009-04-29 /pmc/articles/PMC2681066/ /pubmed/19426453 http://dx.doi.org/10.1186/1471-2105-10-S4-S5 Text en Copyright © 2009 Lin et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Lin, Xiaotong Liu, Mei Chen, Xue-wen Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms |
title | Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms |
title_full | Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms |
title_fullStr | Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms |
title_full_unstemmed | Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms |
title_short | Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms |
title_sort | assessing reliability of protein-protein interactions by integrative analysis of data in model organisms |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2681066/ https://www.ncbi.nlm.nih.gov/pubmed/19426453 http://dx.doi.org/10.1186/1471-2105-10-S4-S5 |
work_keys_str_mv | AT linxiaotong assessingreliabilityofproteinproteininteractionsbyintegrativeanalysisofdatainmodelorganisms AT liumei assessingreliabilityofproteinproteininteractionsbyintegrativeanalysisofdatainmodelorganisms AT chenxuewen assessingreliabilityofproteinproteininteractionsbyintegrativeanalysisofdatainmodelorganisms |