Cargando…

An integrated approach to the prediction of domain-domain interactions

BACKGROUND: The development of high-throughput technologies has produced several large scale protein interaction data sets for multiple species, and significant efforts have been made to analyze the data sets in order to understand protein activities. Considering that the basic units of protein inte...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lee, Hyunju, Deng, Minghua, Sun, Fengzhu, Chen, Ting
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2006
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1481624/ https://www.ncbi.nlm.nih.gov/pubmed/16725050 http://dx.doi.org/10.1186/1471-2105-7-269

_version_	1782128279215407104
author	Lee, Hyunju Deng, Minghua Sun, Fengzhu Chen, Ting
author_facet	Lee, Hyunju Deng, Minghua Sun, Fengzhu Chen, Ting
author_sort	Lee, Hyunju
collection	PubMed
description	BACKGROUND: The development of high-throughput technologies has produced several large scale protein interaction data sets for multiple species, and significant efforts have been made to analyze the data sets in order to understand protein activities. Considering that the basic units of protein interactions are domain interactions, it is crucial to understand protein interactions at the level of the domains. The availability of many diverse biological data sets provides an opportunity to discover the underlying domain interactions within protein interactions through an integration of these biological data sets. RESULTS: We combine protein interaction data sets from multiple species, molecular sequences, and gene ontology to construct a set of high-confidence domain-domain interactions. First, we propose a new measure, the expected number of interactions for each pair of domains, to score domain interactions based on protein interaction data in one species and show that it has similar performance as the E-value defined by Riley et al. [1]. Our new measure is applied to the protein interaction data sets from yeast, worm, fruitfly and humans. Second, information on pairs of domains that coexist in known proteins and on pairs of domains with the same gene ontology function annotations are incorporated to construct a high-confidence set of domain-domain interactions using a Bayesian approach. Finally, we evaluate the set of domain-domain interactions by comparing predicted domain interactions with those defined in iPfam database [2,3] that were derived based on protein structures. The accuracy of predicted domain interactions are also confirmed by comparing with experimentally obtained domain interactions from H. pylori [4]. As a result, a total of 2,391 high-confidence domain interactions are obtained and these domain interactions are used to unravel detailed protein and domain interactions in several protein complexes. CONCLUSION: Our study shows that integration of multiple biological data sets based on the Bayesian approach provides a reliable framework to predict domain interactions. By integrating multiple data sources, the coverage and accuracy of predicted domain interactions can be significantly increased.
format	Text
id	pubmed-1481624
institution	National Center for Biotechnology Information
language	English
publishDate	2006
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-14816242006-06-22 An integrated approach to the prediction of domain-domain interactions Lee, Hyunju Deng, Minghua Sun, Fengzhu Chen, Ting BMC Bioinformatics Research Article BACKGROUND: The development of high-throughput technologies has produced several large scale protein interaction data sets for multiple species, and significant efforts have been made to analyze the data sets in order to understand protein activities. Considering that the basic units of protein interactions are domain interactions, it is crucial to understand protein interactions at the level of the domains. The availability of many diverse biological data sets provides an opportunity to discover the underlying domain interactions within protein interactions through an integration of these biological data sets. RESULTS: We combine protein interaction data sets from multiple species, molecular sequences, and gene ontology to construct a set of high-confidence domain-domain interactions. First, we propose a new measure, the expected number of interactions for each pair of domains, to score domain interactions based on protein interaction data in one species and show that it has similar performance as the E-value defined by Riley et al. [1]. Our new measure is applied to the protein interaction data sets from yeast, worm, fruitfly and humans. Second, information on pairs of domains that coexist in known proteins and on pairs of domains with the same gene ontology function annotations are incorporated to construct a high-confidence set of domain-domain interactions using a Bayesian approach. Finally, we evaluate the set of domain-domain interactions by comparing predicted domain interactions with those defined in iPfam database [2,3] that were derived based on protein structures. The accuracy of predicted domain interactions are also confirmed by comparing with experimentally obtained domain interactions from H. pylori [4]. As a result, a total of 2,391 high-confidence domain interactions are obtained and these domain interactions are used to unravel detailed protein and domain interactions in several protein complexes. CONCLUSION: Our study shows that integration of multiple biological data sets based on the Bayesian approach provides a reliable framework to predict domain interactions. By integrating multiple data sources, the coverage and accuracy of predicted domain interactions can be significantly increased. BioMed Central 2006-05-25 /pmc/articles/PMC1481624/ /pubmed/16725050 http://dx.doi.org/10.1186/1471-2105-7-269 Text en Copyright © 2006 Lee et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Lee, Hyunju Deng, Minghua Sun, Fengzhu Chen, Ting An integrated approach to the prediction of domain-domain interactions
title	An integrated approach to the prediction of domain-domain interactions
title_full	An integrated approach to the prediction of domain-domain interactions
title_fullStr	An integrated approach to the prediction of domain-domain interactions
title_full_unstemmed	An integrated approach to the prediction of domain-domain interactions
title_short	An integrated approach to the prediction of domain-domain interactions
title_sort	integrated approach to the prediction of domain-domain interactions
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1481624/ https://www.ncbi.nlm.nih.gov/pubmed/16725050 http://dx.doi.org/10.1186/1471-2105-7-269
work_keys_str_mv	AT leehyunju anintegratedapproachtothepredictionofdomaindomaininteractions AT dengminghua anintegratedapproachtothepredictionofdomaindomaininteractions AT sunfengzhu anintegratedapproachtothepredictionofdomaindomaininteractions AT chenting anintegratedapproachtothepredictionofdomaindomaininteractions AT leehyunju integratedapproachtothepredictionofdomaindomaininteractions AT dengminghua integratedapproachtothepredictionofdomaindomaininteractions AT sunfengzhu integratedapproachtothepredictionofdomaindomaininteractions AT chenting integratedapproachtothepredictionofdomaindomaininteractions

An integrated approach to the prediction of domain-domain interactions

Ejemplares similares