Cargando…

MAHOMES II: A webserver for predicting if a metal binding site is enzymatic

Recent advances have enabled high-quality computationally generated structures for proteins with no solved crystal structures. However, protein function data remains largely limited to experimental methods and homology mapping. Since structure determines function, it is natural that methods capable...

Descripción completa

Detalles Bibliográficos
Autores principales: Feehan, Ryan, Copeland, Matthew, Franklin, Meghan W., Slusky, Joanna S. G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028950/
https://www.ncbi.nlm.nih.gov/pubmed/36945603
http://dx.doi.org/10.1101/2023.03.08.531790
_version_ 1784910049082081280
author Feehan, Ryan
Copeland, Matthew
Franklin, Meghan W.
Slusky, Joanna S. G.
author_facet Feehan, Ryan
Copeland, Matthew
Franklin, Meghan W.
Slusky, Joanna S. G.
author_sort Feehan, Ryan
collection PubMed
description Recent advances have enabled high-quality computationally generated structures for proteins with no solved crystal structures. However, protein function data remains largely limited to experimental methods and homology mapping. Since structure determines function, it is natural that methods capable of using computationally generated structures for functional annotations need to be advanced. Our laboratory recently developed a method to distinguish between metalloenzyme and non-enzyme sites. Here we report improvements to this method by upgrading our physicochemical features to alleviate the need for structures with sub-angstrom precision and using machine learning to reduce training data labeling error. Our improved classifier identifies protein bound metal sites as enzymatic or non-enzymatic with 94% precision and 92% recall. We demonstrate that both adjustments increased predictive performance and reliability on sites with sub-angstrom variations. We constructed a set of predicted metalloprotein structures with no solved crystal structures and no detectable homology to our training data. Our model had an accuracy of 90 – 97.5% depending on the quality of the predicted structures included in our test. Finally, we found the physicochemical trends that drove this model’s successful performance were local protein density, second shell ionizable residue burial, and the pocket’s accessibility to the site. We anticipate that our model’s ability to correctly identify catalytic metal sites could enable identification of new enzymatic mechanisms and improve de novo metalloenzyme design success rates.
format Online
Article
Text
id pubmed-10028950
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-100289502023-03-22 MAHOMES II: A webserver for predicting if a metal binding site is enzymatic Feehan, Ryan Copeland, Matthew Franklin, Meghan W. Slusky, Joanna S. G. bioRxiv Article Recent advances have enabled high-quality computationally generated structures for proteins with no solved crystal structures. However, protein function data remains largely limited to experimental methods and homology mapping. Since structure determines function, it is natural that methods capable of using computationally generated structures for functional annotations need to be advanced. Our laboratory recently developed a method to distinguish between metalloenzyme and non-enzyme sites. Here we report improvements to this method by upgrading our physicochemical features to alleviate the need for structures with sub-angstrom precision and using machine learning to reduce training data labeling error. Our improved classifier identifies protein bound metal sites as enzymatic or non-enzymatic with 94% precision and 92% recall. We demonstrate that both adjustments increased predictive performance and reliability on sites with sub-angstrom variations. We constructed a set of predicted metalloprotein structures with no solved crystal structures and no detectable homology to our training data. Our model had an accuracy of 90 – 97.5% depending on the quality of the predicted structures included in our test. Finally, we found the physicochemical trends that drove this model’s successful performance were local protein density, second shell ionizable residue burial, and the pocket’s accessibility to the site. We anticipate that our model’s ability to correctly identify catalytic metal sites could enable identification of new enzymatic mechanisms and improve de novo metalloenzyme design success rates. Cold Spring Harbor Laboratory 2023-03-12 /pmc/articles/PMC10028950/ /pubmed/36945603 http://dx.doi.org/10.1101/2023.03.08.531790 Text en https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Feehan, Ryan
Copeland, Matthew
Franklin, Meghan W.
Slusky, Joanna S. G.
MAHOMES II: A webserver for predicting if a metal binding site is enzymatic
title MAHOMES II: A webserver for predicting if a metal binding site is enzymatic
title_full MAHOMES II: A webserver for predicting if a metal binding site is enzymatic
title_fullStr MAHOMES II: A webserver for predicting if a metal binding site is enzymatic
title_full_unstemmed MAHOMES II: A webserver for predicting if a metal binding site is enzymatic
title_short MAHOMES II: A webserver for predicting if a metal binding site is enzymatic
title_sort mahomes ii: a webserver for predicting if a metal binding site is enzymatic
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028950/
https://www.ncbi.nlm.nih.gov/pubmed/36945603
http://dx.doi.org/10.1101/2023.03.08.531790
work_keys_str_mv AT feehanryan mahomesiiawebserverforpredictingifametalbindingsiteisenzymatic
AT copelandmatthew mahomesiiawebserverforpredictingifametalbindingsiteisenzymatic
AT franklinmeghanw mahomesiiawebserverforpredictingifametalbindingsiteisenzymatic
AT sluskyjoannasg mahomesiiawebserverforpredictingifametalbindingsiteisenzymatic