Cargando…

Accuracy of Protein-Protein Binding Sites in High-Throughput Template-Based Modeling

The accuracy of protein structures, particularly their binding sites, is essential for the success of modeling protein complexes. Computationally inexpensive methodology is required for genome-wide modeling of such structures. For systematic evaluation of potential accuracy in high-throughput modeli...

Descripción completa

Detalles Bibliográficos
Autores principales: Kundrotas, Petras J., Vakser, Ilya A.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2848539/
https://www.ncbi.nlm.nih.gov/pubmed/20369011
http://dx.doi.org/10.1371/journal.pcbi.1000727
_version_ 1782179677583966208
author Kundrotas, Petras J.
Vakser, Ilya A.
author_facet Kundrotas, Petras J.
Vakser, Ilya A.
author_sort Kundrotas, Petras J.
collection PubMed
description The accuracy of protein structures, particularly their binding sites, is essential for the success of modeling protein complexes. Computationally inexpensive methodology is required for genome-wide modeling of such structures. For systematic evaluation of potential accuracy in high-throughput modeling of binding sites, a statistical analysis of target-template sequence alignments was performed for a representative set of protein complexes. For most of the complexes, alignments containing all residues of the interface were found. The full interface alignments were obtained even in the case of poor alignments where a relatively small part of the target sequence (as low as 40%) aligned to the template sequence, with a low overall alignment identity (<30%). Although such poor overall alignments might be considered inadequate for modeling of whole proteins, the alignment of the interfaces was strong enough for docking. In the set of homology models built on these alignments, one third of those ranked 1 by a simple sequence identity criteria had RMSD<5 Å, the accuracy suitable for low-resolution template free docking. Such models corresponded to multi-domain target proteins, whereas for single-domain proteins the best models had 5 Å<RMSD<10 Å, the accuracy suitable for less sensitive structure-alignment methods. Overall, ∼50% of complexes with the interfaces modeled by high-throughput techniques had accuracy suitable for meaningful docking experiments. This percentage will grow with the increasing availability of co-crystallized protein-protein complexes.
format Text
id pubmed-2848539
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28485392010-04-05 Accuracy of Protein-Protein Binding Sites in High-Throughput Template-Based Modeling Kundrotas, Petras J. Vakser, Ilya A. PLoS Comput Biol Research Article The accuracy of protein structures, particularly their binding sites, is essential for the success of modeling protein complexes. Computationally inexpensive methodology is required for genome-wide modeling of such structures. For systematic evaluation of potential accuracy in high-throughput modeling of binding sites, a statistical analysis of target-template sequence alignments was performed for a representative set of protein complexes. For most of the complexes, alignments containing all residues of the interface were found. The full interface alignments were obtained even in the case of poor alignments where a relatively small part of the target sequence (as low as 40%) aligned to the template sequence, with a low overall alignment identity (<30%). Although such poor overall alignments might be considered inadequate for modeling of whole proteins, the alignment of the interfaces was strong enough for docking. In the set of homology models built on these alignments, one third of those ranked 1 by a simple sequence identity criteria had RMSD<5 Å, the accuracy suitable for low-resolution template free docking. Such models corresponded to multi-domain target proteins, whereas for single-domain proteins the best models had 5 Å<RMSD<10 Å, the accuracy suitable for less sensitive structure-alignment methods. Overall, ∼50% of complexes with the interfaces modeled by high-throughput techniques had accuracy suitable for meaningful docking experiments. This percentage will grow with the increasing availability of co-crystallized protein-protein complexes. Public Library of Science 2010-04-01 /pmc/articles/PMC2848539/ /pubmed/20369011 http://dx.doi.org/10.1371/journal.pcbi.1000727 Text en Kundrotas, Vakser. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kundrotas, Petras J.
Vakser, Ilya A.
Accuracy of Protein-Protein Binding Sites in High-Throughput Template-Based Modeling
title Accuracy of Protein-Protein Binding Sites in High-Throughput Template-Based Modeling
title_full Accuracy of Protein-Protein Binding Sites in High-Throughput Template-Based Modeling
title_fullStr Accuracy of Protein-Protein Binding Sites in High-Throughput Template-Based Modeling
title_full_unstemmed Accuracy of Protein-Protein Binding Sites in High-Throughput Template-Based Modeling
title_short Accuracy of Protein-Protein Binding Sites in High-Throughput Template-Based Modeling
title_sort accuracy of protein-protein binding sites in high-throughput template-based modeling
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2848539/
https://www.ncbi.nlm.nih.gov/pubmed/20369011
http://dx.doi.org/10.1371/journal.pcbi.1000727
work_keys_str_mv AT kundrotaspetrasj accuracyofproteinproteinbindingsitesinhighthroughputtemplatebasedmodeling
AT vakserilyaa accuracyofproteinproteinbindingsitesinhighthroughputtemplatebasedmodeling