Cargando…

SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability

Modeling the effect of mutations on protein thermodynamics stability is useful for protein engineering and understanding molecular mechanisms of disease-causing variants. Here, we report a new development of the SAAFEC method, the SAAFEC-SEQ, which is a gradient boosting decision tree machine learni...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Gen, Panday, Shailesh Kumar, Alexov, Emil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7827184/
https://www.ncbi.nlm.nih.gov/pubmed/33435356
http://dx.doi.org/10.3390/ijms22020606
_version_ 1783640700902965248
author Li, Gen
Panday, Shailesh Kumar
Alexov, Emil
author_facet Li, Gen
Panday, Shailesh Kumar
Alexov, Emil
author_sort Li, Gen
collection PubMed
description Modeling the effect of mutations on protein thermodynamics stability is useful for protein engineering and understanding molecular mechanisms of disease-causing variants. Here, we report a new development of the SAAFEC method, the SAAFEC-SEQ, which is a gradient boosting decision tree machine learning method to predict the change of the folding free energy caused by amino acid substitutions. The method does not require the 3D structure of the corresponding protein, but only its sequence and, thus, can be applied on genome-scale investigations where structural information is very sparse. SAAFEC-SEQ uses physicochemical properties, sequence features, and evolutionary information features to make the predictions. It is shown to consistently outperform all existing state-of-the-art sequence-based methods in both the Pearson correlation coefficient and root-mean-squared-error parameters as benchmarked on several independent datasets. The SAAFEC-SEQ has been implemented into a web server and is available as stand-alone code that can be downloaded and embedded into other researchers’ code.
format Online
Article
Text
id pubmed-7827184
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-78271842021-01-25 SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability Li, Gen Panday, Shailesh Kumar Alexov, Emil Int J Mol Sci Article Modeling the effect of mutations on protein thermodynamics stability is useful for protein engineering and understanding molecular mechanisms of disease-causing variants. Here, we report a new development of the SAAFEC method, the SAAFEC-SEQ, which is a gradient boosting decision tree machine learning method to predict the change of the folding free energy caused by amino acid substitutions. The method does not require the 3D structure of the corresponding protein, but only its sequence and, thus, can be applied on genome-scale investigations where structural information is very sparse. SAAFEC-SEQ uses physicochemical properties, sequence features, and evolutionary information features to make the predictions. It is shown to consistently outperform all existing state-of-the-art sequence-based methods in both the Pearson correlation coefficient and root-mean-squared-error parameters as benchmarked on several independent datasets. The SAAFEC-SEQ has been implemented into a web server and is available as stand-alone code that can be downloaded and embedded into other researchers’ code. MDPI 2021-01-09 /pmc/articles/PMC7827184/ /pubmed/33435356 http://dx.doi.org/10.3390/ijms22020606 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Gen
Panday, Shailesh Kumar
Alexov, Emil
SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability
title SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability
title_full SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability
title_fullStr SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability
title_full_unstemmed SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability
title_short SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability
title_sort saafec-seq: a sequence-based method for predicting the effect of single point mutations on protein thermodynamic stability
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7827184/
https://www.ncbi.nlm.nih.gov/pubmed/33435356
http://dx.doi.org/10.3390/ijms22020606
work_keys_str_mv AT ligen saafecseqasequencebasedmethodforpredictingtheeffectofsinglepointmutationsonproteinthermodynamicstability
AT pandayshaileshkumar saafecseqasequencebasedmethodforpredictingtheeffectofsinglepointmutationsonproteinthermodynamicstability
AT alexovemil saafecseqasequencebasedmethodforpredictingtheeffectofsinglepointmutationsonproteinthermodynamicstability