Cargando…

Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms

BACKGROUND: Protein-protein interactions play vital roles in nearly all cellular processes and are involved in the construction of biological pathways such as metabolic and signal transduction pathways. Although large-scale experiments have enabled the discovery of thousands of previously unknown li...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Xiaotong, Liu, Mei, Chen, Xue-wen
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2681066/
https://www.ncbi.nlm.nih.gov/pubmed/19426453
http://dx.doi.org/10.1186/1471-2105-10-S4-S5
_version_ 1782167009194147840
author Lin, Xiaotong
Liu, Mei
Chen, Xue-wen
author_facet Lin, Xiaotong
Liu, Mei
Chen, Xue-wen
author_sort Lin, Xiaotong
collection PubMed
description BACKGROUND: Protein-protein interactions play vital roles in nearly all cellular processes and are involved in the construction of biological pathways such as metabolic and signal transduction pathways. Although large-scale experiments have enabled the discovery of thousands of previously unknown linkages among proteins in many organisms, the high-throughput interaction data is often associated with high error rates. Since protein interaction networks have been utilized in numerous biological inferences, the inclusive experimental errors inevitably affect the quality of such prediction. Thus, it is essential to assess the quality of the protein interaction data. RESULTS: In this paper, a novel Bayesian network-based integrative framework is proposed to assess the reliability of protein-protein interactions. We develop a cross-species in silico model that assigns likelihood scores to individual protein pairs based on the information entirely extracted from model organisms. Our proposed approach integrates multiple microarray datasets and novel features derived from gene ontology. Furthermore, the confidence scores for cross-species protein mappings are explicitly incorporated into our model. Applying our model to predict protein interactions in the human genome, we are able to achieve 80% in sensitivity and 70% in specificity. Finally, we assess the overall quality of the experimentally determined yeast protein-protein interaction dataset. We observe that the more high-throughput experiments confirming an interaction, the higher the likelihood score, which confirms the effectiveness of our approach. CONCLUSION: This study demonstrates that model organisms certainly provide important information for protein-protein interaction inference and assessment. The proposed method is able to assess not only the overall quality of an interaction dataset, but also the quality of individual protein-protein interactions. We expect the method to continually improve as more high quality interaction data from more model organisms becomes available and is readily scalable to a genome-wide application.
format Text
id pubmed-2681066
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26810662009-05-13 Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms Lin, Xiaotong Liu, Mei Chen, Xue-wen BMC Bioinformatics Proceedings BACKGROUND: Protein-protein interactions play vital roles in nearly all cellular processes and are involved in the construction of biological pathways such as metabolic and signal transduction pathways. Although large-scale experiments have enabled the discovery of thousands of previously unknown linkages among proteins in many organisms, the high-throughput interaction data is often associated with high error rates. Since protein interaction networks have been utilized in numerous biological inferences, the inclusive experimental errors inevitably affect the quality of such prediction. Thus, it is essential to assess the quality of the protein interaction data. RESULTS: In this paper, a novel Bayesian network-based integrative framework is proposed to assess the reliability of protein-protein interactions. We develop a cross-species in silico model that assigns likelihood scores to individual protein pairs based on the information entirely extracted from model organisms. Our proposed approach integrates multiple microarray datasets and novel features derived from gene ontology. Furthermore, the confidence scores for cross-species protein mappings are explicitly incorporated into our model. Applying our model to predict protein interactions in the human genome, we are able to achieve 80% in sensitivity and 70% in specificity. Finally, we assess the overall quality of the experimentally determined yeast protein-protein interaction dataset. We observe that the more high-throughput experiments confirming an interaction, the higher the likelihood score, which confirms the effectiveness of our approach. CONCLUSION: This study demonstrates that model organisms certainly provide important information for protein-protein interaction inference and assessment. The proposed method is able to assess not only the overall quality of an interaction dataset, but also the quality of individual protein-protein interactions. We expect the method to continually improve as more high quality interaction data from more model organisms becomes available and is readily scalable to a genome-wide application. BioMed Central 2009-04-29 /pmc/articles/PMC2681066/ /pubmed/19426453 http://dx.doi.org/10.1186/1471-2105-10-S4-S5 Text en Copyright © 2009 Lin et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Lin, Xiaotong
Liu, Mei
Chen, Xue-wen
Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms
title Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms
title_full Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms
title_fullStr Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms
title_full_unstemmed Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms
title_short Assessing reliability of protein-protein interactions by integrative analysis of data in model organisms
title_sort assessing reliability of protein-protein interactions by integrative analysis of data in model organisms
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2681066/
https://www.ncbi.nlm.nih.gov/pubmed/19426453
http://dx.doi.org/10.1186/1471-2105-10-S4-S5
work_keys_str_mv AT linxiaotong assessingreliabilityofproteinproteininteractionsbyintegrativeanalysisofdatainmodelorganisms
AT liumei assessingreliabilityofproteinproteininteractionsbyintegrativeanalysisofdatainmodelorganisms
AT chenxuewen assessingreliabilityofproteinproteininteractionsbyintegrativeanalysisofdatainmodelorganisms