Cargando…

Using Machine Learning and Molecular Docking to Leverage Urease Inhibition Data for Virtual Screening

Urease is a metalloenzyme that catalyzes the hydrolysis of urea, and its modulation has an important role in both the agricultural and medical industry. Even though numerous molecules have been tested against ureases of different species, their clinical translation has been limited due to chemical a...

Descripción completa

Detalles Bibliográficos
Autores principales: Aniceto, Natália, Albuquerque, Tânia S., Bonifácio, Vasco D. B., Guedes, Rita C., Martinho, Nuno
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10179503/
https://www.ncbi.nlm.nih.gov/pubmed/37175889
http://dx.doi.org/10.3390/ijms24098180
_version_ 1785041113465225216
author Aniceto, Natália
Albuquerque, Tânia S.
Bonifácio, Vasco D. B.
Guedes, Rita C.
Martinho, Nuno
author_facet Aniceto, Natália
Albuquerque, Tânia S.
Bonifácio, Vasco D. B.
Guedes, Rita C.
Martinho, Nuno
author_sort Aniceto, Natália
collection PubMed
description Urease is a metalloenzyme that catalyzes the hydrolysis of urea, and its modulation has an important role in both the agricultural and medical industry. Even though numerous molecules have been tested against ureases of different species, their clinical translation has been limited due to chemical and metabolic stability as well as side effects. Therefore, screening new compounds against urease would be of interest in part due to rising concerns regarding antibiotic resistance. In this work, we collected and curated a diverse set of 2640 publicly available small-molecule inhibitors of jack bean urease and developed a classifier using a random forest machine learning method with high predictive performance. In addition, the physicochemical features of compounds were paired with molecular docking and protein–ligand fingerprint analysis to gather insight into the current activity landscape. We observed that the docking score could not differentiate active from inactive compounds within each chemical family, but scores were correlated with compound activity when all compounds were considered. Additionally, a decision tree model was built based on 2D and 3D Morgan fingerprints to mine patterns of the known active-class compounds. The final machine learning model showed good prediction performance against the test set (81% and 77% precision for active and inactive compounds, respectively). Finally, this model was employed, as a proof-of-concept, on an in-house library to predict new hits that were then tested against urease and found to be active. This is, to date, the largest, most diverse dataset of compounds used to develop predictive in silico models. Overall, the results highlight the usefulness of using machine learning classifiers and molecular docking to predict novel urease inhibitors.
format Online
Article
Text
id pubmed-10179503
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101795032023-05-13 Using Machine Learning and Molecular Docking to Leverage Urease Inhibition Data for Virtual Screening Aniceto, Natália Albuquerque, Tânia S. Bonifácio, Vasco D. B. Guedes, Rita C. Martinho, Nuno Int J Mol Sci Article Urease is a metalloenzyme that catalyzes the hydrolysis of urea, and its modulation has an important role in both the agricultural and medical industry. Even though numerous molecules have been tested against ureases of different species, their clinical translation has been limited due to chemical and metabolic stability as well as side effects. Therefore, screening new compounds against urease would be of interest in part due to rising concerns regarding antibiotic resistance. In this work, we collected and curated a diverse set of 2640 publicly available small-molecule inhibitors of jack bean urease and developed a classifier using a random forest machine learning method with high predictive performance. In addition, the physicochemical features of compounds were paired with molecular docking and protein–ligand fingerprint analysis to gather insight into the current activity landscape. We observed that the docking score could not differentiate active from inactive compounds within each chemical family, but scores were correlated with compound activity when all compounds were considered. Additionally, a decision tree model was built based on 2D and 3D Morgan fingerprints to mine patterns of the known active-class compounds. The final machine learning model showed good prediction performance against the test set (81% and 77% precision for active and inactive compounds, respectively). Finally, this model was employed, as a proof-of-concept, on an in-house library to predict new hits that were then tested against urease and found to be active. This is, to date, the largest, most diverse dataset of compounds used to develop predictive in silico models. Overall, the results highlight the usefulness of using machine learning classifiers and molecular docking to predict novel urease inhibitors. MDPI 2023-05-03 /pmc/articles/PMC10179503/ /pubmed/37175889 http://dx.doi.org/10.3390/ijms24098180 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Aniceto, Natália
Albuquerque, Tânia S.
Bonifácio, Vasco D. B.
Guedes, Rita C.
Martinho, Nuno
Using Machine Learning and Molecular Docking to Leverage Urease Inhibition Data for Virtual Screening
title Using Machine Learning and Molecular Docking to Leverage Urease Inhibition Data for Virtual Screening
title_full Using Machine Learning and Molecular Docking to Leverage Urease Inhibition Data for Virtual Screening
title_fullStr Using Machine Learning and Molecular Docking to Leverage Urease Inhibition Data for Virtual Screening
title_full_unstemmed Using Machine Learning and Molecular Docking to Leverage Urease Inhibition Data for Virtual Screening
title_short Using Machine Learning and Molecular Docking to Leverage Urease Inhibition Data for Virtual Screening
title_sort using machine learning and molecular docking to leverage urease inhibition data for virtual screening
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10179503/
https://www.ncbi.nlm.nih.gov/pubmed/37175889
http://dx.doi.org/10.3390/ijms24098180
work_keys_str_mv AT anicetonatalia usingmachinelearningandmoleculardockingtoleverageureaseinhibitiondataforvirtualscreening
AT albuquerquetanias usingmachinelearningandmoleculardockingtoleverageureaseinhibitiondataforvirtualscreening
AT bonifaciovascodb usingmachinelearningandmoleculardockingtoleverageureaseinhibitiondataforvirtualscreening
AT guedesritac usingmachinelearningandmoleculardockingtoleverageureaseinhibitiondataforvirtualscreening
AT martinhonuno usingmachinelearningandmoleculardockingtoleverageureaseinhibitiondataforvirtualscreening