Cargando…
EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases
When bioprospecting for novel industrial enzymes, substrate promiscuity is a desirable property that increases the reusability of the enzyme. Among industrial enzymes, ester hydrolases have great relevance for which the demand has not ceased to increase. However, the search for new substrate promisc...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9599548/ https://www.ncbi.nlm.nih.gov/pubmed/36291739 http://dx.doi.org/10.3390/biom12101529 |
_version_ | 1784816621484769280 |
---|---|
author | Xiang, Ruite Fernandez-Lopez, Laura Robles-Martín, Ana Ferrer, Manuel Guallar, Victor |
author_facet | Xiang, Ruite Fernandez-Lopez, Laura Robles-Martín, Ana Ferrer, Manuel Guallar, Victor |
author_sort | Xiang, Ruite |
collection | PubMed |
description | When bioprospecting for novel industrial enzymes, substrate promiscuity is a desirable property that increases the reusability of the enzyme. Among industrial enzymes, ester hydrolases have great relevance for which the demand has not ceased to increase. However, the search for new substrate promiscuous ester hydrolases is not trivial since the mechanism behind this property is greatly influenced by the active site’s structural and physicochemical characteristics. These characteristics must be computed from the 3D structure, which is rarely available and expensive to measure, hence the need for a method that can predict promiscuity from sequence alone. Here we report such a method called EP-pred, an ensemble binary classifier, that combines three machine learning algorithms: SVM, KNN, and a Linear model. EP-pred has been evaluated against the Lipase Engineering Database together with a hidden Markov approach leading to a final set of ten sequences predicted to encode promiscuous esterases. Experimental results confirmed the validity of our method since all ten proteins were found to exhibit a broad substrate ambiguity. |
format | Online Article Text |
id | pubmed-9599548 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-95995482022-10-27 EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases Xiang, Ruite Fernandez-Lopez, Laura Robles-Martín, Ana Ferrer, Manuel Guallar, Victor Biomolecules Article When bioprospecting for novel industrial enzymes, substrate promiscuity is a desirable property that increases the reusability of the enzyme. Among industrial enzymes, ester hydrolases have great relevance for which the demand has not ceased to increase. However, the search for new substrate promiscuous ester hydrolases is not trivial since the mechanism behind this property is greatly influenced by the active site’s structural and physicochemical characteristics. These characteristics must be computed from the 3D structure, which is rarely available and expensive to measure, hence the need for a method that can predict promiscuity from sequence alone. Here we report such a method called EP-pred, an ensemble binary classifier, that combines three machine learning algorithms: SVM, KNN, and a Linear model. EP-pred has been evaluated against the Lipase Engineering Database together with a hidden Markov approach leading to a final set of ten sequences predicted to encode promiscuous esterases. Experimental results confirmed the validity of our method since all ten proteins were found to exhibit a broad substrate ambiguity. MDPI 2022-10-21 /pmc/articles/PMC9599548/ /pubmed/36291739 http://dx.doi.org/10.3390/biom12101529 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Xiang, Ruite Fernandez-Lopez, Laura Robles-Martín, Ana Ferrer, Manuel Guallar, Victor EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases |
title | EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases |
title_full | EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases |
title_fullStr | EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases |
title_full_unstemmed | EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases |
title_short | EP-Pred: A Machine Learning Tool for Bioprospecting Promiscuous Ester Hydrolases |
title_sort | ep-pred: a machine learning tool for bioprospecting promiscuous ester hydrolases |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9599548/ https://www.ncbi.nlm.nih.gov/pubmed/36291739 http://dx.doi.org/10.3390/biom12101529 |
work_keys_str_mv | AT xiangruite eppredamachinelearningtoolforbioprospectingpromiscuousesterhydrolases AT fernandezlopezlaura eppredamachinelearningtoolforbioprospectingpromiscuousesterhydrolases AT roblesmartinana eppredamachinelearningtoolforbioprospectingpromiscuousesterhydrolases AT ferrermanuel eppredamachinelearningtoolforbioprospectingpromiscuousesterhydrolases AT guallarvictor eppredamachinelearningtoolforbioprospectingpromiscuousesterhydrolases |