Cargando…

A base measure of precision for protein stability predictors: structural sensitivity

BACKGROUND: Prediction of the change in fold stability (ΔΔG) of a protein upon mutation is of major importance to protein engineering and screening of disease-causing variants. Many prediction methods can use 3D structural information to predict ΔΔG. While the performance of these methods has been e...

Descripción completa

Detalles Bibliográficos
Autores principales: Caldararu, Octav, Blundell, Tom L., Kepp, Kasper P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7908712/
https://www.ncbi.nlm.nih.gov/pubmed/33632133
http://dx.doi.org/10.1186/s12859-021-04030-w
_version_ 1783655775720177664
author Caldararu, Octav
Blundell, Tom L.
Kepp, Kasper P.
author_facet Caldararu, Octav
Blundell, Tom L.
Kepp, Kasper P.
author_sort Caldararu, Octav
collection PubMed
description BACKGROUND: Prediction of the change in fold stability (ΔΔG) of a protein upon mutation is of major importance to protein engineering and screening of disease-causing variants. Many prediction methods can use 3D structural information to predict ΔΔG. While the performance of these methods has been extensively studied, a new problem has arisen due to the abundance of crystal structures: How precise are these methods in terms of structure input used, which structure should be used, and how much does it matter? Thus, there is a need to quantify the structural sensitivity of protein stability prediction methods. RESULTS: We computed the structural sensitivity of six widely-used prediction methods by use of saturated computational mutagenesis on a diverse set of 87 structures of 25 proteins. Our results show that structural sensitivity varies massively and surprisingly falls into two very distinct groups, with methods that take detailed account of the local environment showing a sensitivity of ~ 0.6 to 0.8 kcal/mol, whereas machine-learning methods display much lower sensitivity (~ 0.1 kcal/mol). We also observe that the precision correlates with the accuracy for mutation-type-balanced data sets but not generally reported accuracy of the methods, indicating the importance of mutation-type balance in both contexts. CONCLUSIONS: The structural sensitivity of stability prediction methods varies greatly and is caused mainly by the models and less by the actual protein structural differences. As a new recommended standard, we therefore suggest that ΔΔG values are evaluated on three protein structures when available and the associated standard deviation reported, to emphasize not just the accuracy but also the precision of the method in a specific study. Our observation that machine-learning methods deemphasize structure may indicate that folded wild-type structures alone, without the folded mutant and unfolded structures, only add modest value for assessing protein stability effects, and that side-chain-sensitive methods overstate the significance of the folded wild-type structure.
format Online
Article
Text
id pubmed-7908712
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-79087122021-02-26 A base measure of precision for protein stability predictors: structural sensitivity Caldararu, Octav Blundell, Tom L. Kepp, Kasper P. BMC Bioinformatics Research Article BACKGROUND: Prediction of the change in fold stability (ΔΔG) of a protein upon mutation is of major importance to protein engineering and screening of disease-causing variants. Many prediction methods can use 3D structural information to predict ΔΔG. While the performance of these methods has been extensively studied, a new problem has arisen due to the abundance of crystal structures: How precise are these methods in terms of structure input used, which structure should be used, and how much does it matter? Thus, there is a need to quantify the structural sensitivity of protein stability prediction methods. RESULTS: We computed the structural sensitivity of six widely-used prediction methods by use of saturated computational mutagenesis on a diverse set of 87 structures of 25 proteins. Our results show that structural sensitivity varies massively and surprisingly falls into two very distinct groups, with methods that take detailed account of the local environment showing a sensitivity of ~ 0.6 to 0.8 kcal/mol, whereas machine-learning methods display much lower sensitivity (~ 0.1 kcal/mol). We also observe that the precision correlates with the accuracy for mutation-type-balanced data sets but not generally reported accuracy of the methods, indicating the importance of mutation-type balance in both contexts. CONCLUSIONS: The structural sensitivity of stability prediction methods varies greatly and is caused mainly by the models and less by the actual protein structural differences. As a new recommended standard, we therefore suggest that ΔΔG values are evaluated on three protein structures when available and the associated standard deviation reported, to emphasize not just the accuracy but also the precision of the method in a specific study. Our observation that machine-learning methods deemphasize structure may indicate that folded wild-type structures alone, without the folded mutant and unfolded structures, only add modest value for assessing protein stability effects, and that side-chain-sensitive methods overstate the significance of the folded wild-type structure. BioMed Central 2021-02-25 /pmc/articles/PMC7908712/ /pubmed/33632133 http://dx.doi.org/10.1186/s12859-021-04030-w Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Caldararu, Octav
Blundell, Tom L.
Kepp, Kasper P.
A base measure of precision for protein stability predictors: structural sensitivity
title A base measure of precision for protein stability predictors: structural sensitivity
title_full A base measure of precision for protein stability predictors: structural sensitivity
title_fullStr A base measure of precision for protein stability predictors: structural sensitivity
title_full_unstemmed A base measure of precision for protein stability predictors: structural sensitivity
title_short A base measure of precision for protein stability predictors: structural sensitivity
title_sort base measure of precision for protein stability predictors: structural sensitivity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7908712/
https://www.ncbi.nlm.nih.gov/pubmed/33632133
http://dx.doi.org/10.1186/s12859-021-04030-w
work_keys_str_mv AT caldararuoctav abasemeasureofprecisionforproteinstabilitypredictorsstructuralsensitivity
AT blundelltoml abasemeasureofprecisionforproteinstabilitypredictorsstructuralsensitivity
AT keppkasperp abasemeasureofprecisionforproteinstabilitypredictorsstructuralsensitivity
AT caldararuoctav basemeasureofprecisionforproteinstabilitypredictorsstructuralsensitivity
AT blundelltoml basemeasureofprecisionforproteinstabilitypredictorsstructuralsensitivity
AT keppkasperp basemeasureofprecisionforproteinstabilitypredictorsstructuralsensitivity