Cargando…

Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences

BACKGROUND: Protein-protein interactions underlie many important biological processes. Computational prediction methods can nicely complement experimental approaches for identifying protein-protein interactions. Recently, a unique category of sequence-based prediction methods has been put forward -...

Descripción completa

Detalles Bibliográficos
Autor principal: Park, Yungki
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2803199/
https://www.ncbi.nlm.nih.gov/pubmed/20003442
http://dx.doi.org/10.1186/1471-2105-10-419
_version_ 1782176031560433664
author Park, Yungki
author_facet Park, Yungki
author_sort Park, Yungki
collection PubMed
description BACKGROUND: Protein-protein interactions underlie many important biological processes. Computational prediction methods can nicely complement experimental approaches for identifying protein-protein interactions. Recently, a unique category of sequence-based prediction methods has been put forward - unique in the sense that it does not require homologous protein sequences. This enables it to be universally applicable to all protein sequences unlike many of previous sequence-based prediction methods. If effective as claimed, these new sequence-based, universally applicable prediction methods would have far-reaching utilities in many areas of biology research. RESULTS: Upon close survey, I realized that many of these new methods were ill-tested. In addition, newer methods were often published without performance comparison with previous ones. Thus, it is not clear how good they are and whether there are significant performance differences among them. In this study, I have implemented and thoroughly tested 4 different methods on large-scale, non-redundant data sets. It reveals several important points. First, significant performance differences are noted among different methods. Second, data sets typically used for training prediction methods appear significantly biased, limiting the general applicability of prediction methods trained with them. Third, there is still ample room for further developments. In addition, my analysis illustrates the importance of complementary performance measures coupled with right-sized data sets for meaningful benchmark tests. CONCLUSIONS: The current study reveals the potentials and limits of the new category of sequence-based protein-protein interaction prediction methods, which in turn provides a firm ground for future endeavours in this important area of contemporary bioinformatics.
format Text
id pubmed-2803199
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28031992010-01-08 Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences Park, Yungki BMC Bioinformatics Research article BACKGROUND: Protein-protein interactions underlie many important biological processes. Computational prediction methods can nicely complement experimental approaches for identifying protein-protein interactions. Recently, a unique category of sequence-based prediction methods has been put forward - unique in the sense that it does not require homologous protein sequences. This enables it to be universally applicable to all protein sequences unlike many of previous sequence-based prediction methods. If effective as claimed, these new sequence-based, universally applicable prediction methods would have far-reaching utilities in many areas of biology research. RESULTS: Upon close survey, I realized that many of these new methods were ill-tested. In addition, newer methods were often published without performance comparison with previous ones. Thus, it is not clear how good they are and whether there are significant performance differences among them. In this study, I have implemented and thoroughly tested 4 different methods on large-scale, non-redundant data sets. It reveals several important points. First, significant performance differences are noted among different methods. Second, data sets typically used for training prediction methods appear significantly biased, limiting the general applicability of prediction methods trained with them. Third, there is still ample room for further developments. In addition, my analysis illustrates the importance of complementary performance measures coupled with right-sized data sets for meaningful benchmark tests. CONCLUSIONS: The current study reveals the potentials and limits of the new category of sequence-based protein-protein interaction prediction methods, which in turn provides a firm ground for future endeavours in this important area of contemporary bioinformatics. BioMed Central 2009-12-14 /pmc/articles/PMC2803199/ /pubmed/20003442 http://dx.doi.org/10.1186/1471-2105-10-419 Text en Copyright ©2009 Park; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Park, Yungki
Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences
title Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences
title_full Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences
title_fullStr Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences
title_full_unstemmed Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences
title_short Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences
title_sort critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2803199/
https://www.ncbi.nlm.nih.gov/pubmed/20003442
http://dx.doi.org/10.1186/1471-2105-10-419
work_keys_str_mv AT parkyungki criticalassessmentofsequencebasedproteinproteininteractionpredictionmethodsthatdonotrequirehomologousproteinsequences