Cargando…

A supersecondary structure library and search algorithm for modeling loops in protein structures

We present a fragment-search based method for predicting loop conformations in protein models. A hierarchical and multidimensional database has been set up that currently classifies 105 950 loop fragments and loop flanking secondary structures. Besides the length of the loops and types of bracing se...

Descripción completa

Detalles Bibliográficos
Autores principales: Fernandez-Fuentes, Narcis, Oliva, Baldomero, Fiser, András
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1440879/
https://www.ncbi.nlm.nih.gov/pubmed/16617149
http://dx.doi.org/10.1093/nar/gkl156
_version_ 1782127340712624128
author Fernandez-Fuentes, Narcis
Oliva, Baldomero
Fiser, András
author_facet Fernandez-Fuentes, Narcis
Oliva, Baldomero
Fiser, András
author_sort Fernandez-Fuentes, Narcis
collection PubMed
description We present a fragment-search based method for predicting loop conformations in protein models. A hierarchical and multidimensional database has been set up that currently classifies 105 950 loop fragments and loop flanking secondary structures. Besides the length of the loops and types of bracing secondary structures the database is organized along four internal coordinates, a distance and three types of angles characterizing the geometry of stem regions. Candidate fragments are selected from this library by matching the length, the types of bracing secondary structures of the query and satisfying the geometrical restraints of the stems and subsequently inserted in the query protein framework where their fit is assessed by the root mean square deviation (r.m.s.d.) of stem regions and by the number of rigid body clashes with the environment. In the final step remaining candidate loops are ranked by a Z-score that combines information on sequence similarity and fit of predicted and observed ϕ/ψ main chain dihedral angle propensities. Confidence Z-score cut-offs were determined for each loop length that identify those predicted fragments that outperform a competitive ab initio method. A web server implements the method, regularly updates the fragment library and performs prediction. Predicted segments are returned, or optionally, these can be completed with side chain reconstruction and subsequently annealed in the environment of the query protein by conjugate gradient minimization. The prediction method was tested on artificially prepared search datasets where all trivial sequence similarities on the SCOP superfamily level were removed. Under these conditions it is possible to predict loops of length 4, 8 and 12 with coverage of 98, 78 and 28% with at least of 0.22, 1.38 and 2.47 Å of r.m.s.d. accuracy, respectively. In a head-to-head comparison on loops extracted from freshly deposited new protein folds the current method outperformed in a ∼5:1 ratio an earlier developed database search method.
format Text
id pubmed-1440879
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-14408792006-04-20 A supersecondary structure library and search algorithm for modeling loops in protein structures Fernandez-Fuentes, Narcis Oliva, Baldomero Fiser, András Nucleic Acids Res Article We present a fragment-search based method for predicting loop conformations in protein models. A hierarchical and multidimensional database has been set up that currently classifies 105 950 loop fragments and loop flanking secondary structures. Besides the length of the loops and types of bracing secondary structures the database is organized along four internal coordinates, a distance and three types of angles characterizing the geometry of stem regions. Candidate fragments are selected from this library by matching the length, the types of bracing secondary structures of the query and satisfying the geometrical restraints of the stems and subsequently inserted in the query protein framework where their fit is assessed by the root mean square deviation (r.m.s.d.) of stem regions and by the number of rigid body clashes with the environment. In the final step remaining candidate loops are ranked by a Z-score that combines information on sequence similarity and fit of predicted and observed ϕ/ψ main chain dihedral angle propensities. Confidence Z-score cut-offs were determined for each loop length that identify those predicted fragments that outperform a competitive ab initio method. A web server implements the method, regularly updates the fragment library and performs prediction. Predicted segments are returned, or optionally, these can be completed with side chain reconstruction and subsequently annealed in the environment of the query protein by conjugate gradient minimization. The prediction method was tested on artificially prepared search datasets where all trivial sequence similarities on the SCOP superfamily level were removed. Under these conditions it is possible to predict loops of length 4, 8 and 12 with coverage of 98, 78 and 28% with at least of 0.22, 1.38 and 2.47 Å of r.m.s.d. accuracy, respectively. In a head-to-head comparison on loops extracted from freshly deposited new protein folds the current method outperformed in a ∼5:1 ratio an earlier developed database search method. Oxford University Press 2006 2006-04-14 /pmc/articles/PMC1440879/ /pubmed/16617149 http://dx.doi.org/10.1093/nar/gkl156 Text en © The Author 2006. Published by Oxford University Press. All rights reserved
spellingShingle Article
Fernandez-Fuentes, Narcis
Oliva, Baldomero
Fiser, András
A supersecondary structure library and search algorithm for modeling loops in protein structures
title A supersecondary structure library and search algorithm for modeling loops in protein structures
title_full A supersecondary structure library and search algorithm for modeling loops in protein structures
title_fullStr A supersecondary structure library and search algorithm for modeling loops in protein structures
title_full_unstemmed A supersecondary structure library and search algorithm for modeling loops in protein structures
title_short A supersecondary structure library and search algorithm for modeling loops in protein structures
title_sort supersecondary structure library and search algorithm for modeling loops in protein structures
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1440879/
https://www.ncbi.nlm.nih.gov/pubmed/16617149
http://dx.doi.org/10.1093/nar/gkl156
work_keys_str_mv AT fernandezfuentesnarcis asupersecondarystructurelibraryandsearchalgorithmformodelingloopsinproteinstructures
AT olivabaldomero asupersecondarystructurelibraryandsearchalgorithmformodelingloopsinproteinstructures
AT fiserandras asupersecondarystructurelibraryandsearchalgorithmformodelingloopsinproteinstructures
AT fernandezfuentesnarcis supersecondarystructurelibraryandsearchalgorithmformodelingloopsinproteinstructures
AT olivabaldomero supersecondarystructurelibraryandsearchalgorithmformodelingloopsinproteinstructures
AT fiserandras supersecondarystructurelibraryandsearchalgorithmformodelingloopsinproteinstructures