Cargando…

Discover protein sequence signatures from protein-protein interaction data

BACKGROUND: The development of high-throughput technologies such as yeast two-hybrid systems and mass spectrometry technologies has made it possible to generate large protein-protein interaction (PPI) datasets. Mining these datasets for underlying biological knowledge has, however, remained a challe...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Jianwen, Haasl, Ryan J, Dong, Yinghua, Lushington, Gerald H
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1310605/
https://www.ncbi.nlm.nih.gov/pubmed/16305745
http://dx.doi.org/10.1186/1471-2105-6-277
_version_ 1782126310669156352
author Fang, Jianwen
Haasl, Ryan J
Dong, Yinghua
Lushington, Gerald H
author_facet Fang, Jianwen
Haasl, Ryan J
Dong, Yinghua
Lushington, Gerald H
author_sort Fang, Jianwen
collection PubMed
description BACKGROUND: The development of high-throughput technologies such as yeast two-hybrid systems and mass spectrometry technologies has made it possible to generate large protein-protein interaction (PPI) datasets. Mining these datasets for underlying biological knowledge has, however, remained a challenge. RESULTS: A total of 3108 sequence signatures were found, each of which was shared by a set of guest proteins interacting with one of 944 host proteins in Saccharomyces cerevisiae genome. Approximately 94% of these sequence signatures matched entries in InterPro member databases. We identified 84 distinct sequence signatures from the remaining 172 unknown signatures. The signature sharing information was then applied in predicting sub-cellular localization of yeast proteins and the novel signatures were used in identifying possible interacting sites. CONCLUSION: We reported a method of PPI data mining that facilitated the discovery of novel sequence signatures using a large PPI dataset from S. cerevisiae genome as input. The fact that 94% of discovered signatures were known validated the ability of the approach to identify large numbers of signatures from PPI data. The significance of these discovered signatures was demonstrated by their application in predicting sub-cellular localizations and identifying potential interaction binding sites of yeast proteins.
format Text
id pubmed-1310605
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-13106052005-12-10 Discover protein sequence signatures from protein-protein interaction data Fang, Jianwen Haasl, Ryan J Dong, Yinghua Lushington, Gerald H BMC Bioinformatics Research Article BACKGROUND: The development of high-throughput technologies such as yeast two-hybrid systems and mass spectrometry technologies has made it possible to generate large protein-protein interaction (PPI) datasets. Mining these datasets for underlying biological knowledge has, however, remained a challenge. RESULTS: A total of 3108 sequence signatures were found, each of which was shared by a set of guest proteins interacting with one of 944 host proteins in Saccharomyces cerevisiae genome. Approximately 94% of these sequence signatures matched entries in InterPro member databases. We identified 84 distinct sequence signatures from the remaining 172 unknown signatures. The signature sharing information was then applied in predicting sub-cellular localization of yeast proteins and the novel signatures were used in identifying possible interacting sites. CONCLUSION: We reported a method of PPI data mining that facilitated the discovery of novel sequence signatures using a large PPI dataset from S. cerevisiae genome as input. The fact that 94% of discovered signatures were known validated the ability of the approach to identify large numbers of signatures from PPI data. The significance of these discovered signatures was demonstrated by their application in predicting sub-cellular localizations and identifying potential interaction binding sites of yeast proteins. BioMed Central 2005-11-23 /pmc/articles/PMC1310605/ /pubmed/16305745 http://dx.doi.org/10.1186/1471-2105-6-277 Text en Copyright © 2005 Fang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Fang, Jianwen
Haasl, Ryan J
Dong, Yinghua
Lushington, Gerald H
Discover protein sequence signatures from protein-protein interaction data
title Discover protein sequence signatures from protein-protein interaction data
title_full Discover protein sequence signatures from protein-protein interaction data
title_fullStr Discover protein sequence signatures from protein-protein interaction data
title_full_unstemmed Discover protein sequence signatures from protein-protein interaction data
title_short Discover protein sequence signatures from protein-protein interaction data
title_sort discover protein sequence signatures from protein-protein interaction data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1310605/
https://www.ncbi.nlm.nih.gov/pubmed/16305745
http://dx.doi.org/10.1186/1471-2105-6-277
work_keys_str_mv AT fangjianwen discoverproteinsequencesignaturesfromproteinproteininteractiondata
AT haaslryanj discoverproteinsequencesignaturesfromproteinproteininteractiondata
AT dongyinghua discoverproteinsequencesignaturesfromproteinproteininteractiondata
AT lushingtongeraldh discoverproteinsequencesignaturesfromproteinproteininteractiondata