Cargando…
A comprehensive analysis of 40 blind protein structure predictions
BACKGROUND: We thoroughly analyse the results of 40 blind predictions for which an experimental answer was made available at the fourth meeting on the critical assessment of protein structure methods (CASP4). Using our comparative modelling and fold recognition methodologies, we made 29 predictions...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2002
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC122083/ https://www.ncbi.nlm.nih.gov/pubmed/12150712 http://dx.doi.org/10.1186/1472-6807-2-3 |
_version_ | 1782120305586601984 |
---|---|
author | Samudrala, Ram Levitt, Michael |
author_facet | Samudrala, Ram Levitt, Michael |
author_sort | Samudrala, Ram |
collection | PubMed |
description | BACKGROUND: We thoroughly analyse the results of 40 blind predictions for which an experimental answer was made available at the fourth meeting on the critical assessment of protein structure methods (CASP4). Using our comparative modelling and fold recognition methodologies, we made 29 predictions for targets that had sequence identities ranging from 50% to 10% to the nearest related protein with known structure. Using our ab initio methodologies, we made eleven predictions for targets that had no detectable sequence relationships. RESULTS: For 23 of these proteins, we produced models ranging from 1.0 to 6.0 Å root mean square deviation (RMSD) for the C(α) atoms between the model and the corresponding experimental structure for all or large parts of the protein, with model accuracies scaling fairly linearly with respect to sequence identity (i.e., the higher the sequence identity, the better the prediction). We produced nine models with accuracies ranging from 4.0 to 6.0 Å C(α) RMSD for 60–100 residue proteins (or large fragments of a protein), with a prediction accuracy of 4.0 Å C(α) RMSD for residues 1–80 for T110/rbfa. CONCLUSIONS: The areas of protein structure prediction that work well, and areas that need improvement, are discernable by examining how our methods have performed over the past four CASP experiments. These results have implications for modelling the structure of all tractable proteins encoded by the genome of an organism. |
format | Text |
id | pubmed-122083 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2002 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-1220832002-09-10 A comprehensive analysis of 40 blind protein structure predictions Samudrala, Ram Levitt, Michael BMC Struct Biol Research Article BACKGROUND: We thoroughly analyse the results of 40 blind predictions for which an experimental answer was made available at the fourth meeting on the critical assessment of protein structure methods (CASP4). Using our comparative modelling and fold recognition methodologies, we made 29 predictions for targets that had sequence identities ranging from 50% to 10% to the nearest related protein with known structure. Using our ab initio methodologies, we made eleven predictions for targets that had no detectable sequence relationships. RESULTS: For 23 of these proteins, we produced models ranging from 1.0 to 6.0 Å root mean square deviation (RMSD) for the C(α) atoms between the model and the corresponding experimental structure for all or large parts of the protein, with model accuracies scaling fairly linearly with respect to sequence identity (i.e., the higher the sequence identity, the better the prediction). We produced nine models with accuracies ranging from 4.0 to 6.0 Å C(α) RMSD for 60–100 residue proteins (or large fragments of a protein), with a prediction accuracy of 4.0 Å C(α) RMSD for residues 1–80 for T110/rbfa. CONCLUSIONS: The areas of protein structure prediction that work well, and areas that need improvement, are discernable by examining how our methods have performed over the past four CASP experiments. These results have implications for modelling the structure of all tractable proteins encoded by the genome of an organism. BioMed Central 2002-08-01 /pmc/articles/PMC122083/ /pubmed/12150712 http://dx.doi.org/10.1186/1472-6807-2-3 Text en Copyright © 2002 Samudrala and Levitt; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL. |
spellingShingle | Research Article Samudrala, Ram Levitt, Michael A comprehensive analysis of 40 blind protein structure predictions |
title | A comprehensive analysis of 40 blind protein structure predictions |
title_full | A comprehensive analysis of 40 blind protein structure predictions |
title_fullStr | A comprehensive analysis of 40 blind protein structure predictions |
title_full_unstemmed | A comprehensive analysis of 40 blind protein structure predictions |
title_short | A comprehensive analysis of 40 blind protein structure predictions |
title_sort | comprehensive analysis of 40 blind protein structure predictions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC122083/ https://www.ncbi.nlm.nih.gov/pubmed/12150712 http://dx.doi.org/10.1186/1472-6807-2-3 |
work_keys_str_mv | AT samudralaram acomprehensiveanalysisof40blindproteinstructurepredictions AT levittmichael acomprehensiveanalysisof40blindproteinstructurepredictions AT samudralaram comprehensiveanalysisof40blindproteinstructurepredictions AT levittmichael comprehensiveanalysisof40blindproteinstructurepredictions |