Cargando…

EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases

When bioprospecting for novel industrial enzymes, substrate promiscuity is a desirable property that increases the reusability of the enzyme. Among industrial enzymes, ester hydrolases have great relevance for which the demand has not ceased to increase. However, the search for new substrate promisc...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiang, Ruite, Fernandez-Lopez, Laura, Robles-Martín, Ana, Ferrer, Manuel, Guallar, Victor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9599548/
https://www.ncbi.nlm.nih.gov/pubmed/36291739
http://dx.doi.org/10.3390/biom12101529
_version_ 1784816621484769280
author Xiang, Ruite
Fernandez-Lopez, Laura
Robles-Martín, Ana
Ferrer, Manuel
Guallar, Victor
author_facet Xiang, Ruite
Fernandez-Lopez, Laura
Robles-Martín, Ana
Ferrer, Manuel
Guallar, Victor
author_sort Xiang, Ruite
collection PubMed
description When bioprospecting for novel industrial enzymes, substrate promiscuity is a desirable property that increases the reusability of the enzyme. Among industrial enzymes, ester hydrolases have great relevance for which the demand has not ceased to increase. However, the search for new substrate promiscuous ester hydrolases is not trivial since the mechanism behind this property is greatly influenced by the active site’s structural and physicochemical characteristics. These characteristics must be computed from the 3D structure, which is rarely available and expensive to measure, hence the need for a method that can predict promiscuity from sequence alone. Here we report such a method called EP-pred, an ensemble binary classifier, that combines three machine learning algorithms: SVM, KNN, and a Linear model. EP-pred has been evaluated against the Lipase Engineering Database together with a hidden Markov approach leading to a final set of ten sequences predicted to encode promiscuous esterases. Experimental results confirmed the validity of our method since all ten proteins were found to exhibit a broad substrate ambiguity.
format Online
Article
Text
id pubmed-9599548
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95995482022-10-27 EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases Xiang, Ruite Fernandez-Lopez, Laura Robles-Martín, Ana Ferrer, Manuel Guallar, Victor Biomolecules Article When bioprospecting for novel industrial enzymes, substrate promiscuity is a desirable property that increases the reusability of the enzyme. Among industrial enzymes, ester hydrolases have great relevance for which the demand has not ceased to increase. However, the search for new substrate promiscuous ester hydrolases is not trivial since the mechanism behind this property is greatly influenced by the active site’s structural and physicochemical characteristics. These characteristics must be computed from the 3D structure, which is rarely available and expensive to measure, hence the need for a method that can predict promiscuity from sequence alone. Here we report such a method called EP-pred, an ensemble binary classifier, that combines three machine learning algorithms: SVM, KNN, and a Linear model. EP-pred has been evaluated against the Lipase Engineering Database together with a hidden Markov approach leading to a final set of ten sequences predicted to encode promiscuous esterases. Experimental results confirmed the validity of our method since all ten proteins were found to exhibit a broad substrate ambiguity. MDPI 2022-10-21 /pmc/articles/PMC9599548/ /pubmed/36291739 http://dx.doi.org/10.3390/biom12101529 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Xiang, Ruite
Fernandez-Lopez, Laura
Robles-Martín, Ana
Ferrer, Manuel
Guallar, Victor
EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases
title EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases
title_full EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases
title_fullStr EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases
title_full_unstemmed EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases
title_short EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases
title_sort ep-pred: a machine learning tool for bioprospecting promiscuous ester hydrolases
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9599548/
https://www.ncbi.nlm.nih.gov/pubmed/36291739
http://dx.doi.org/10.3390/biom12101529
work_keys_str_mv AT xiangruite eppredamachinelearningtoolforbioprospectingpromiscuousesterhydrolases
AT fernandezlopezlaura eppredamachinelearningtoolforbioprospectingpromiscuousesterhydrolases
AT roblesmartinana eppredamachinelearningtoolforbioprospectingpromiscuousesterhydrolases
AT ferrermanuel eppredamachinelearningtoolforbioprospectingpromiscuousesterhydrolases
AT guallarvictor eppredamachinelearningtoolforbioprospectingpromiscuousesterhydrolases