Cargando…

Role of simple descriptors and applicability domain in predicting change in protein thermostability

The melting temperature (Tm) of a protein is the temperature at which half of the protein population is in a folded state. Therefore, Tm is a measure of the thermostability of a protein. Increasing the Tm of a protein is a critical goal in biotechnology and biomedicine. However, predicting the chang...

Descripción completa

Detalles Bibliográficos
Autores principales: McGuinness, Kenneth N., Pan, Weilan, Sheridan, Robert P., Murphy, Grant, Crespo, Alejandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6128648/
https://www.ncbi.nlm.nih.gov/pubmed/30192891
http://dx.doi.org/10.1371/journal.pone.0203819
_version_ 1783353684341555200
author McGuinness, Kenneth N.
Pan, Weilan
Sheridan, Robert P.
Murphy, Grant
Crespo, Alejandro
author_facet McGuinness, Kenneth N.
Pan, Weilan
Sheridan, Robert P.
Murphy, Grant
Crespo, Alejandro
author_sort McGuinness, Kenneth N.
collection PubMed
description The melting temperature (Tm) of a protein is the temperature at which half of the protein population is in a folded state. Therefore, Tm is a measure of the thermostability of a protein. Increasing the Tm of a protein is a critical goal in biotechnology and biomedicine. However, predicting the change in melting temperature (dTm) due to mutations at a single residue is difficult because it depends on an intricate balance of forces. Existing methods for predicting dTm have had similar levels of success using generally complex models. We find that training a machine learning model with a simple set of easy to calculate physicochemical descriptors describing the local environment of the mutation performed as well as more complicated machine learning models and is 2–6 orders of magnitude faster. Importantly, unlike in most previous publications, we perform a blind prospective test on our simple model by designing 96 variants of a protein not in the training set. Results from retrospective and prospective predictions reveal the limited applicability domain of each model. This study highlights the current deficiencies in the available dTm dataset and is a call to the community to systematically design a larger and more diverse experimental dataset of mutants to prospectively predict dTm with greater certainty.
format Online
Article
Text
id pubmed-6128648
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-61286482018-09-15 Role of simple descriptors and applicability domain in predicting change in protein thermostability McGuinness, Kenneth N. Pan, Weilan Sheridan, Robert P. Murphy, Grant Crespo, Alejandro PLoS One Research Article The melting temperature (Tm) of a protein is the temperature at which half of the protein population is in a folded state. Therefore, Tm is a measure of the thermostability of a protein. Increasing the Tm of a protein is a critical goal in biotechnology and biomedicine. However, predicting the change in melting temperature (dTm) due to mutations at a single residue is difficult because it depends on an intricate balance of forces. Existing methods for predicting dTm have had similar levels of success using generally complex models. We find that training a machine learning model with a simple set of easy to calculate physicochemical descriptors describing the local environment of the mutation performed as well as more complicated machine learning models and is 2–6 orders of magnitude faster. Importantly, unlike in most previous publications, we perform a blind prospective test on our simple model by designing 96 variants of a protein not in the training set. Results from retrospective and prospective predictions reveal the limited applicability domain of each model. This study highlights the current deficiencies in the available dTm dataset and is a call to the community to systematically design a larger and more diverse experimental dataset of mutants to prospectively predict dTm with greater certainty. Public Library of Science 2018-09-07 /pmc/articles/PMC6128648/ /pubmed/30192891 http://dx.doi.org/10.1371/journal.pone.0203819 Text en © 2018 McGuinness et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
McGuinness, Kenneth N.
Pan, Weilan
Sheridan, Robert P.
Murphy, Grant
Crespo, Alejandro
Role of simple descriptors and applicability domain in predicting change in protein thermostability
title Role of simple descriptors and applicability domain in predicting change in protein thermostability
title_full Role of simple descriptors and applicability domain in predicting change in protein thermostability
title_fullStr Role of simple descriptors and applicability domain in predicting change in protein thermostability
title_full_unstemmed Role of simple descriptors and applicability domain in predicting change in protein thermostability
title_short Role of simple descriptors and applicability domain in predicting change in protein thermostability
title_sort role of simple descriptors and applicability domain in predicting change in protein thermostability
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6128648/
https://www.ncbi.nlm.nih.gov/pubmed/30192891
http://dx.doi.org/10.1371/journal.pone.0203819
work_keys_str_mv AT mcguinnesskennethn roleofsimpledescriptorsandapplicabilitydomaininpredictingchangeinproteinthermostability
AT panweilan roleofsimpledescriptorsandapplicabilitydomaininpredictingchangeinproteinthermostability
AT sheridanrobertp roleofsimpledescriptorsandapplicabilitydomaininpredictingchangeinproteinthermostability
AT murphygrant roleofsimpledescriptorsandapplicabilitydomaininpredictingchangeinproteinthermostability
AT crespoalejandro roleofsimpledescriptorsandapplicabilitydomaininpredictingchangeinproteinthermostability