Cargando…

A comprehensive analysis of 40 blind protein structure predictions

BACKGROUND: We thoroughly analyse the results of 40 blind predictions for which an experimental answer was made available at the fourth meeting on the critical assessment of protein structure methods (CASP4). Using our comparative modelling and fold recognition methodologies, we made 29 predictions...

Descripción completa

Detalles Bibliográficos
Autores principales: Samudrala, Ram, Levitt, Michael
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC122083/
https://www.ncbi.nlm.nih.gov/pubmed/12150712
http://dx.doi.org/10.1186/1472-6807-2-3
_version_ 1782120305586601984
author Samudrala, Ram
Levitt, Michael
author_facet Samudrala, Ram
Levitt, Michael
author_sort Samudrala, Ram
collection PubMed
description BACKGROUND: We thoroughly analyse the results of 40 blind predictions for which an experimental answer was made available at the fourth meeting on the critical assessment of protein structure methods (CASP4). Using our comparative modelling and fold recognition methodologies, we made 29 predictions for targets that had sequence identities ranging from 50% to 10% to the nearest related protein with known structure. Using our ab initio methodologies, we made eleven predictions for targets that had no detectable sequence relationships. RESULTS: For 23 of these proteins, we produced models ranging from 1.0 to 6.0 Å root mean square deviation (RMSD) for the C(α) atoms between the model and the corresponding experimental structure for all or large parts of the protein, with model accuracies scaling fairly linearly with respect to sequence identity (i.e., the higher the sequence identity, the better the prediction). We produced nine models with accuracies ranging from 4.0 to 6.0 Å C(α) RMSD for 60–100 residue proteins (or large fragments of a protein), with a prediction accuracy of 4.0 Å C(α) RMSD for residues 1–80 for T110/rbfa. CONCLUSIONS: The areas of protein structure prediction that work well, and areas that need improvement, are discernable by examining how our methods have performed over the past four CASP experiments. These results have implications for modelling the structure of all tractable proteins encoded by the genome of an organism.
format Text
id pubmed-122083
institution National Center for Biotechnology Information
language English
publishDate 2002
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-1220832002-09-10 A comprehensive analysis of 40 blind protein structure predictions Samudrala, Ram Levitt, Michael BMC Struct Biol Research Article BACKGROUND: We thoroughly analyse the results of 40 blind predictions for which an experimental answer was made available at the fourth meeting on the critical assessment of protein structure methods (CASP4). Using our comparative modelling and fold recognition methodologies, we made 29 predictions for targets that had sequence identities ranging from 50% to 10% to the nearest related protein with known structure. Using our ab initio methodologies, we made eleven predictions for targets that had no detectable sequence relationships. RESULTS: For 23 of these proteins, we produced models ranging from 1.0 to 6.0 Å root mean square deviation (RMSD) for the C(α) atoms between the model and the corresponding experimental structure for all or large parts of the protein, with model accuracies scaling fairly linearly with respect to sequence identity (i.e., the higher the sequence identity, the better the prediction). We produced nine models with accuracies ranging from 4.0 to 6.0 Å C(α) RMSD for 60–100 residue proteins (or large fragments of a protein), with a prediction accuracy of 4.0 Å C(α) RMSD for residues 1–80 for T110/rbfa. CONCLUSIONS: The areas of protein structure prediction that work well, and areas that need improvement, are discernable by examining how our methods have performed over the past four CASP experiments. These results have implications for modelling the structure of all tractable proteins encoded by the genome of an organism. BioMed Central 2002-08-01 /pmc/articles/PMC122083/ /pubmed/12150712 http://dx.doi.org/10.1186/1472-6807-2-3 Text en Copyright © 2002 Samudrala and Levitt; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Research Article
Samudrala, Ram
Levitt, Michael
A comprehensive analysis of 40 blind protein structure predictions
title A comprehensive analysis of 40 blind protein structure predictions
title_full A comprehensive analysis of 40 blind protein structure predictions
title_fullStr A comprehensive analysis of 40 blind protein structure predictions
title_full_unstemmed A comprehensive analysis of 40 blind protein structure predictions
title_short A comprehensive analysis of 40 blind protein structure predictions
title_sort comprehensive analysis of 40 blind protein structure predictions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC122083/
https://www.ncbi.nlm.nih.gov/pubmed/12150712
http://dx.doi.org/10.1186/1472-6807-2-3
work_keys_str_mv AT samudralaram acomprehensiveanalysisof40blindproteinstructurepredictions
AT levittmichael acomprehensiveanalysisof40blindproteinstructurepredictions
AT samudralaram comprehensiveanalysisof40blindproteinstructurepredictions
AT levittmichael comprehensiveanalysisof40blindproteinstructurepredictions