Cargando…

Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces

BACKGROUND: Protein-protein interactions are ubiquitous and essential for all cellular processes. High-resolution X-ray crystallographic structures of protein complexes can reveal the details of their function and provide a basis for many computational and experimental approaches. Differentiation be...

Descripción completa

Detalles Bibliográficos
Autores principales: Bordner, Andrew J, Gorin, Andrey A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2413245/
https://www.ncbi.nlm.nih.gov/pubmed/18474114
http://dx.doi.org/10.1186/1471-2105-9-234
_version_ 1782156039148273664
author Bordner, Andrew J
Gorin, Andrey A
author_facet Bordner, Andrew J
Gorin, Andrey A
author_sort Bordner, Andrew J
collection PubMed
description BACKGROUND: Protein-protein interactions are ubiquitous and essential for all cellular processes. High-resolution X-ray crystallographic structures of protein complexes can reveal the details of their function and provide a basis for many computational and experimental approaches. Differentiation between biological and non-biological contacts and reconstruction of the intact complex is a challenging computational problem. A successful solution can provide additional insights into the fundamental principles of biological recognition and reduce errors in many algorithms and databases utilizing interaction information extracted from the Protein Data Bank (PDB). RESULTS: We have developed a method for identifying protein complexes in the PDB X-ray structures by a four step procedure: (1) comprehensively collecting all protein-protein interfaces; (2) clustering similar protein-protein interfaces together; (3) estimating the probability that each cluster is relevant based on a diverse set of properties; and (4) combining these scores for each PDB entry in order to predict the complex structure. The resulting clusters of biologically relevant interfaces provide a reliable catalog of evolutionary conserved protein-protein interactions. These interfaces, as well as the predicted protein complexes, are available from the Protein Interface Server (PInS) website (see Availability and requirements section). CONCLUSION: Our method demonstrates an almost two-fold reduction of the annotation error rate as evaluated on a large benchmark set of complexes validated from the literature. We also estimate relative contributions of each interface property to the accurate discrimination of biologically relevant interfaces and discuss possible directions for further improving the prediction method.
format Text
id pubmed-2413245
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24132452008-06-06 Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces Bordner, Andrew J Gorin, Andrey A BMC Bioinformatics Research Article BACKGROUND: Protein-protein interactions are ubiquitous and essential for all cellular processes. High-resolution X-ray crystallographic structures of protein complexes can reveal the details of their function and provide a basis for many computational and experimental approaches. Differentiation between biological and non-biological contacts and reconstruction of the intact complex is a challenging computational problem. A successful solution can provide additional insights into the fundamental principles of biological recognition and reduce errors in many algorithms and databases utilizing interaction information extracted from the Protein Data Bank (PDB). RESULTS: We have developed a method for identifying protein complexes in the PDB X-ray structures by a four step procedure: (1) comprehensively collecting all protein-protein interfaces; (2) clustering similar protein-protein interfaces together; (3) estimating the probability that each cluster is relevant based on a diverse set of properties; and (4) combining these scores for each PDB entry in order to predict the complex structure. The resulting clusters of biologically relevant interfaces provide a reliable catalog of evolutionary conserved protein-protein interactions. These interfaces, as well as the predicted protein complexes, are available from the Protein Interface Server (PInS) website (see Availability and requirements section). CONCLUSION: Our method demonstrates an almost two-fold reduction of the annotation error rate as evaluated on a large benchmark set of complexes validated from the literature. We also estimate relative contributions of each interface property to the accurate discrimination of biologically relevant interfaces and discuss possible directions for further improving the prediction method. BioMed Central 2008-05-12 /pmc/articles/PMC2413245/ /pubmed/18474114 http://dx.doi.org/10.1186/1471-2105-9-234 Text en Copyright © 2008 Bordner and Gorin; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Bordner, Andrew J
Gorin, Andrey A
Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces
title Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces
title_full Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces
title_fullStr Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces
title_full_unstemmed Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces
title_short Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces
title_sort comprehensive inventory of protein complexes in the protein data bank from consistent classification of interfaces
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2413245/
https://www.ncbi.nlm.nih.gov/pubmed/18474114
http://dx.doi.org/10.1186/1471-2105-9-234
work_keys_str_mv AT bordnerandrewj comprehensiveinventoryofproteincomplexesintheproteindatabankfromconsistentclassificationofinterfaces
AT gorinandreya comprehensiveinventoryofproteincomplexesintheproteindatabankfromconsistentclassificationofinterfaces