Cargando…

MetaMQAP: A meta-server for the quality assessment of protein models

BACKGROUND: Computational models of protein structure are usually inaccurate and exhibit significant deviations from the true structure. The utility of models depends on the degree of these deviations. A number of predictive methods have been developed to discriminate between the globally incorrect...

Descripción completa

Detalles Bibliográficos
Autores principales: Pawlowski, Marcin, Gajda, Michal J, Matlak, Ryszard, Bujnicki, Janusz M
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2573893/
https://www.ncbi.nlm.nih.gov/pubmed/18823532
http://dx.doi.org/10.1186/1471-2105-9-403
_version_ 1782160285239345152
author Pawlowski, Marcin
Gajda, Michal J
Matlak, Ryszard
Bujnicki, Janusz M
author_facet Pawlowski, Marcin
Gajda, Michal J
Matlak, Ryszard
Bujnicki, Janusz M
author_sort Pawlowski, Marcin
collection PubMed
description BACKGROUND: Computational models of protein structure are usually inaccurate and exhibit significant deviations from the true structure. The utility of models depends on the degree of these deviations. A number of predictive methods have been developed to discriminate between the globally incorrect and approximately correct models. However, only a few methods predict correctness of different parts of computational models. Several Model Quality Assessment Programs (MQAPs) have been developed to detect local inaccuracies in unrefined crystallographic models, but it is not known if they are useful for computational models, which usually exhibit different and much more severe errors. RESULTS: The ability to identify local errors in models was tested for eight MQAPs: VERIFY3D, PROSA, BALA, ANOLEA, PROVE, TUNE, REFINER, PROQRES on 8251 models from the CASP-5 and CASP-6 experiments, by calculating the Spearman's rank correlation coefficients between per-residue scores of these methods and local deviations between C-alpha atoms in the models vs. experimental structures. As a reference, we calculated the value of correlation between the local deviations and trivial features that can be calculated for each residue directly from the models, i.e. solvent accessibility, depth in the structure, and the number of local and non-local neighbours. We found that absolute correlations of scores returned by the MQAPs and local deviations were poor for all methods. In addition, scores of PROQRES and several other MQAPs strongly correlate with 'trivial' features. Therefore, we developed MetaMQAP, a meta-predictor based on a multivariate regression model, which uses scores of the above-mentioned methods, but in which trivial parameters are controlled. MetaMQAP predicts the absolute deviation (in Ångströms) of individual C-alpha atoms between the model and the unknown true structure as well as global deviations (expressed as root mean square deviation and GDT_TS scores). Local model accuracy predicted by MetaMQAP shows an impressive correlation coefficient of 0.7 with true deviations from native structures, a significant improvement over all constituent primary MQAP scores. The global MetaMQAP score is correlated with model GDT_TS on the level of 0.89. CONCLUSION: Finally, we compared our method with the MQAPs that scored best in the 7th edition of CASP, using CASP7 server models (not included in the MetaMQAP training set) as the test data. In our benchmark, MetaMQAP is outperformed only by PCONS6 and method QA_556 – methods that require comparison of multiple alternative models and score each of them depending on its similarity to other models. MetaMQAP is however the best among methods capable of evaluating just single models. We implemented the MetaMQAP as a web server available for free use by all academic users at the URL
format Text
id pubmed-2573893
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25738932008-10-27 MetaMQAP: A meta-server for the quality assessment of protein models Pawlowski, Marcin Gajda, Michal J Matlak, Ryszard Bujnicki, Janusz M BMC Bioinformatics Software BACKGROUND: Computational models of protein structure are usually inaccurate and exhibit significant deviations from the true structure. The utility of models depends on the degree of these deviations. A number of predictive methods have been developed to discriminate between the globally incorrect and approximately correct models. However, only a few methods predict correctness of different parts of computational models. Several Model Quality Assessment Programs (MQAPs) have been developed to detect local inaccuracies in unrefined crystallographic models, but it is not known if they are useful for computational models, which usually exhibit different and much more severe errors. RESULTS: The ability to identify local errors in models was tested for eight MQAPs: VERIFY3D, PROSA, BALA, ANOLEA, PROVE, TUNE, REFINER, PROQRES on 8251 models from the CASP-5 and CASP-6 experiments, by calculating the Spearman's rank correlation coefficients between per-residue scores of these methods and local deviations between C-alpha atoms in the models vs. experimental structures. As a reference, we calculated the value of correlation between the local deviations and trivial features that can be calculated for each residue directly from the models, i.e. solvent accessibility, depth in the structure, and the number of local and non-local neighbours. We found that absolute correlations of scores returned by the MQAPs and local deviations were poor for all methods. In addition, scores of PROQRES and several other MQAPs strongly correlate with 'trivial' features. Therefore, we developed MetaMQAP, a meta-predictor based on a multivariate regression model, which uses scores of the above-mentioned methods, but in which trivial parameters are controlled. MetaMQAP predicts the absolute deviation (in Ångströms) of individual C-alpha atoms between the model and the unknown true structure as well as global deviations (expressed as root mean square deviation and GDT_TS scores). Local model accuracy predicted by MetaMQAP shows an impressive correlation coefficient of 0.7 with true deviations from native structures, a significant improvement over all constituent primary MQAP scores. The global MetaMQAP score is correlated with model GDT_TS on the level of 0.89. CONCLUSION: Finally, we compared our method with the MQAPs that scored best in the 7th edition of CASP, using CASP7 server models (not included in the MetaMQAP training set) as the test data. In our benchmark, MetaMQAP is outperformed only by PCONS6 and method QA_556 – methods that require comparison of multiple alternative models and score each of them depending on its similarity to other models. MetaMQAP is however the best among methods capable of evaluating just single models. We implemented the MetaMQAP as a web server available for free use by all academic users at the URL BioMed Central 2008-09-29 /pmc/articles/PMC2573893/ /pubmed/18823532 http://dx.doi.org/10.1186/1471-2105-9-403 Text en Copyright © 2008 Pawlowski et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Pawlowski, Marcin
Gajda, Michal J
Matlak, Ryszard
Bujnicki, Janusz M
MetaMQAP: A meta-server for the quality assessment of protein models
title MetaMQAP: A meta-server for the quality assessment of protein models
title_full MetaMQAP: A meta-server for the quality assessment of protein models
title_fullStr MetaMQAP: A meta-server for the quality assessment of protein models
title_full_unstemmed MetaMQAP: A meta-server for the quality assessment of protein models
title_short MetaMQAP: A meta-server for the quality assessment of protein models
title_sort metamqap: a meta-server for the quality assessment of protein models
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2573893/
https://www.ncbi.nlm.nih.gov/pubmed/18823532
http://dx.doi.org/10.1186/1471-2105-9-403
work_keys_str_mv AT pawlowskimarcin metamqapametaserverforthequalityassessmentofproteinmodels
AT gajdamichalj metamqapametaserverforthequalityassessmentofproteinmodels
AT matlakryszard metamqapametaserverforthequalityassessmentofproteinmodels
AT bujnickijanuszm metamqapametaserverforthequalityassessmentofproteinmodels