Cargando…

Machine learning differentiates enzymatic and non-enzymatic metals in proteins

Metalloenzymes are 40% of all enzymes and can perform all seven classes of enzyme reactions. Because of the physicochemical similarities between the active sites of metalloenzymes and inactive metal binding sites, it is challenging to differentiate between them. Yet distinguishing these two classes...

Descripción completa

Detalles Bibliográficos
Autores principales: Feehan, Ryan, Franklin, Meghan W., Slusky, Joanna S. G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8211803/
https://www.ncbi.nlm.nih.gov/pubmed/34140507
http://dx.doi.org/10.1038/s41467-021-24070-3
_version_ 1783709545250422784
author Feehan, Ryan
Franklin, Meghan W.
Slusky, Joanna S. G.
author_facet Feehan, Ryan
Franklin, Meghan W.
Slusky, Joanna S. G.
author_sort Feehan, Ryan
collection PubMed
description Metalloenzymes are 40% of all enzymes and can perform all seven classes of enzyme reactions. Because of the physicochemical similarities between the active sites of metalloenzymes and inactive metal binding sites, it is challenging to differentiate between them. Yet distinguishing these two classes is critical for the identification of both native and designed enzymes. Because of similarities between catalytic and non-catalytic  metal binding sites, finding physicochemical features that distinguish these two types of metal sites can indicate aspects that are critical to enzyme function. In this work, we develop the largest structural dataset of enzymatic and non-enzymatic metalloprotein sites to date. We then use a decision-tree ensemble machine learning model to classify metals bound to proteins as enzymatic or non-enzymatic with 92.2% precision and 90.1% recall. Our model scores electrostatic and pocket lining features as more important than pocket volume, despite the fact that volume is the most quantitatively different feature between enzyme and non-enzymatic sites. Finally, we find our model has overall better performance in a side-to-side comparison against other methods that differentiate enzymatic from non-enzymatic sequences. We anticipate that our model’s ability to correctly identify which metal sites are responsible for enzymatic activity could enable identification of new enzymatic mechanisms and de novo enzyme design.
format Online
Article
Text
id pubmed-8211803
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-82118032021-07-01 Machine learning differentiates enzymatic and non-enzymatic metals in proteins Feehan, Ryan Franklin, Meghan W. Slusky, Joanna S. G. Nat Commun Article Metalloenzymes are 40% of all enzymes and can perform all seven classes of enzyme reactions. Because of the physicochemical similarities between the active sites of metalloenzymes and inactive metal binding sites, it is challenging to differentiate between them. Yet distinguishing these two classes is critical for the identification of both native and designed enzymes. Because of similarities between catalytic and non-catalytic  metal binding sites, finding physicochemical features that distinguish these two types of metal sites can indicate aspects that are critical to enzyme function. In this work, we develop the largest structural dataset of enzymatic and non-enzymatic metalloprotein sites to date. We then use a decision-tree ensemble machine learning model to classify metals bound to proteins as enzymatic or non-enzymatic with 92.2% precision and 90.1% recall. Our model scores electrostatic and pocket lining features as more important than pocket volume, despite the fact that volume is the most quantitatively different feature between enzyme and non-enzymatic sites. Finally, we find our model has overall better performance in a side-to-side comparison against other methods that differentiate enzymatic from non-enzymatic sequences. We anticipate that our model’s ability to correctly identify which metal sites are responsible for enzymatic activity could enable identification of new enzymatic mechanisms and de novo enzyme design. Nature Publishing Group UK 2021-06-17 /pmc/articles/PMC8211803/ /pubmed/34140507 http://dx.doi.org/10.1038/s41467-021-24070-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Feehan, Ryan
Franklin, Meghan W.
Slusky, Joanna S. G.
Machine learning differentiates enzymatic and non-enzymatic metals in proteins
title Machine learning differentiates enzymatic and non-enzymatic metals in proteins
title_full Machine learning differentiates enzymatic and non-enzymatic metals in proteins
title_fullStr Machine learning differentiates enzymatic and non-enzymatic metals in proteins
title_full_unstemmed Machine learning differentiates enzymatic and non-enzymatic metals in proteins
title_short Machine learning differentiates enzymatic and non-enzymatic metals in proteins
title_sort machine learning differentiates enzymatic and non-enzymatic metals in proteins
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8211803/
https://www.ncbi.nlm.nih.gov/pubmed/34140507
http://dx.doi.org/10.1038/s41467-021-24070-3
work_keys_str_mv AT feehanryan machinelearningdifferentiatesenzymaticandnonenzymaticmetalsinproteins
AT franklinmeghanw machinelearningdifferentiatesenzymaticandnonenzymaticmetalsinproteins
AT sluskyjoannasg machinelearningdifferentiatesenzymaticandnonenzymaticmetalsinproteins