Cargando…

DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations

BACKGROUND: Predicting the effect of single point variations on protein stability constitutes a crucial step toward understanding the relationship between protein structure and function. To this end, several methods have been developed to predict changes in the Gibbs free energy of unfolding (∆∆G) b...

Descripción completa

Detalles Bibliográficos
Autores principales: Montanucci, Ludovica, Capriotti, Emidio, Frank, Yotam, Ben-Tal, Nir, Fariselli, Piero
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6606456/
https://www.ncbi.nlm.nih.gov/pubmed/31266447
http://dx.doi.org/10.1186/s12859-019-2923-1
_version_ 1783431911918534656
author Montanucci, Ludovica
Capriotti, Emidio
Frank, Yotam
Ben-Tal, Nir
Fariselli, Piero
author_facet Montanucci, Ludovica
Capriotti, Emidio
Frank, Yotam
Ben-Tal, Nir
Fariselli, Piero
author_sort Montanucci, Ludovica
collection PubMed
description BACKGROUND: Predicting the effect of single point variations on protein stability constitutes a crucial step toward understanding the relationship between protein structure and function. To this end, several methods have been developed to predict changes in the Gibbs free energy of unfolding (∆∆G) between wild type and variant proteins, using sequence and structure information. Most of the available methods however do not exhibit the anti-symmetric prediction property, which guarantees that the predicted ∆∆G value for a variation is the exact opposite of that predicted for the reverse variation, i.e., ∆∆G(A → B) = −∆∆G(B → A), where A and B are amino acids. RESULTS: Here we introduce simple anti-symmetric features, based on evolutionary information, which are combined to define an untrained method, DDGun (DDG untrained). DDGun is a simple approach based on evolutionary information that predicts the ∆∆G for single and multiple variations from sequence and structure information (DDGun3D). Our method achieves remarkable performance without any training on the experimental datasets, reaching Pearson correlation coefficients between predicted and measured ∆∆G values of ~ 0.5 and ~ 0.4 for single and multiple site variations, respectively. Surprisingly, DDGun performances are comparable with those of state of the art methods. DDGun also naturally predicts multiple site variations, thereby defining a benchmark method for both single site and multiple site predictors. DDGun is anti-symmetric by construction predicting the value of the ∆∆G of a reciprocal variation as almost equal (depending on the sequence profile) to -∆∆G of the direct variation. This is a valuable property that is missing in the majority of the methods. CONCLUSIONS: Evolutionary information alone combined in an untrained method can achieve remarkably high performances in the prediction of ∆∆G upon protein mutation. Non-trained approaches like DDGun represent a valid benchmark both for scoring the predictive power of the individual features and for assessing the learning capability of supervised methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2923-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6606456
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66064562019-07-12 DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations Montanucci, Ludovica Capriotti, Emidio Frank, Yotam Ben-Tal, Nir Fariselli, Piero BMC Bioinformatics Research BACKGROUND: Predicting the effect of single point variations on protein stability constitutes a crucial step toward understanding the relationship between protein structure and function. To this end, several methods have been developed to predict changes in the Gibbs free energy of unfolding (∆∆G) between wild type and variant proteins, using sequence and structure information. Most of the available methods however do not exhibit the anti-symmetric prediction property, which guarantees that the predicted ∆∆G value for a variation is the exact opposite of that predicted for the reverse variation, i.e., ∆∆G(A → B) = −∆∆G(B → A), where A and B are amino acids. RESULTS: Here we introduce simple anti-symmetric features, based on evolutionary information, which are combined to define an untrained method, DDGun (DDG untrained). DDGun is a simple approach based on evolutionary information that predicts the ∆∆G for single and multiple variations from sequence and structure information (DDGun3D). Our method achieves remarkable performance without any training on the experimental datasets, reaching Pearson correlation coefficients between predicted and measured ∆∆G values of ~ 0.5 and ~ 0.4 for single and multiple site variations, respectively. Surprisingly, DDGun performances are comparable with those of state of the art methods. DDGun also naturally predicts multiple site variations, thereby defining a benchmark method for both single site and multiple site predictors. DDGun is anti-symmetric by construction predicting the value of the ∆∆G of a reciprocal variation as almost equal (depending on the sequence profile) to -∆∆G of the direct variation. This is a valuable property that is missing in the majority of the methods. CONCLUSIONS: Evolutionary information alone combined in an untrained method can achieve remarkably high performances in the prediction of ∆∆G upon protein mutation. Non-trained approaches like DDGun represent a valid benchmark both for scoring the predictive power of the individual features and for assessing the learning capability of supervised methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2923-1) contains supplementary material, which is available to authorized users. BioMed Central 2019-07-03 /pmc/articles/PMC6606456/ /pubmed/31266447 http://dx.doi.org/10.1186/s12859-019-2923-1 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Montanucci, Ludovica
Capriotti, Emidio
Frank, Yotam
Ben-Tal, Nir
Fariselli, Piero
DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations
title DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations
title_full DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations
title_fullStr DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations
title_full_unstemmed DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations
title_short DDGun: an untrained method for the prediction of protein stability changes upon single and multiple point variations
title_sort ddgun: an untrained method for the prediction of protein stability changes upon single and multiple point variations
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6606456/
https://www.ncbi.nlm.nih.gov/pubmed/31266447
http://dx.doi.org/10.1186/s12859-019-2923-1
work_keys_str_mv AT montanucciludovica ddgunanuntrainedmethodforthepredictionofproteinstabilitychangesuponsingleandmultiplepointvariations
AT capriottiemidio ddgunanuntrainedmethodforthepredictionofproteinstabilitychangesuponsingleandmultiplepointvariations
AT frankyotam ddgunanuntrainedmethodforthepredictionofproteinstabilitychangesuponsingleandmultiplepointvariations
AT bentalnir ddgunanuntrainedmethodforthepredictionofproteinstabilitychangesuponsingleandmultiplepointvariations
AT farisellipiero ddgunanuntrainedmethodforthepredictionofproteinstabilitychangesuponsingleandmultiplepointvariations