Cargando…

A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery

Screening for novel antibacterial compounds in small molecule libraries has a low success rate. We applied machine learning (ML)-based virtual screening for antibacterial activity and evaluated its predictive power by experimental validation. We first binarized 29,537 compounds according to their gr...

Descripción completa

Detalles Bibliográficos
Autores principales: Rahman, A. S. M. Zisanur, Liu, Chengyou, Sturm, Hunter, Hogan, Andrew M., Davis, Rebecca, Hu, Pingzhao, Cardona, Silvia T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9624395/
https://www.ncbi.nlm.nih.gov/pubmed/36228001
http://dx.doi.org/10.1371/journal.pcbi.1010613
_version_ 1784822226007097344
author Rahman, A. S. M. Zisanur
Liu, Chengyou
Sturm, Hunter
Hogan, Andrew M.
Davis, Rebecca
Hu, Pingzhao
Cardona, Silvia T.
author_facet Rahman, A. S. M. Zisanur
Liu, Chengyou
Sturm, Hunter
Hogan, Andrew M.
Davis, Rebecca
Hu, Pingzhao
Cardona, Silvia T.
author_sort Rahman, A. S. M. Zisanur
collection PubMed
description Screening for novel antibacterial compounds in small molecule libraries has a low success rate. We applied machine learning (ML)-based virtual screening for antibacterial activity and evaluated its predictive power by experimental validation. We first binarized 29,537 compounds according to their growth inhibitory activity (hit rate 0.87%) against the antibiotic-resistant bacterium Burkholderia cenocepacia and described their molecular features with a directed-message passing neural network (D-MPNN). Then, we used the data to train an ML model that achieved a receiver operating characteristic (ROC) score of 0.823 on the test set. Finally, we predicted antibacterial activity in virtual libraries corresponding to 1,614 compounds from the Food and Drug Administration (FDA)-approved list and 224,205 natural products. Hit rates of 26% and 12%, respectively, were obtained when we tested the top-ranked predicted compounds for growth inhibitory activity against B. cenocepacia, which represents at least a 14-fold increase from the previous hit rate. In addition, more than 51% of the predicted antibacterial natural compounds inhibited ESKAPE pathogens showing that predictions expand beyond the organism-specific dataset to a broad range of bacteria. Overall, the developed ML approach can be used for compound prioritization before screening, increasing the typical hit rate of drug discovery.
format Online
Article
Text
id pubmed-9624395
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-96243952022-11-02 A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery Rahman, A. S. M. Zisanur Liu, Chengyou Sturm, Hunter Hogan, Andrew M. Davis, Rebecca Hu, Pingzhao Cardona, Silvia T. PLoS Comput Biol Research Article Screening for novel antibacterial compounds in small molecule libraries has a low success rate. We applied machine learning (ML)-based virtual screening for antibacterial activity and evaluated its predictive power by experimental validation. We first binarized 29,537 compounds according to their growth inhibitory activity (hit rate 0.87%) against the antibiotic-resistant bacterium Burkholderia cenocepacia and described their molecular features with a directed-message passing neural network (D-MPNN). Then, we used the data to train an ML model that achieved a receiver operating characteristic (ROC) score of 0.823 on the test set. Finally, we predicted antibacterial activity in virtual libraries corresponding to 1,614 compounds from the Food and Drug Administration (FDA)-approved list and 224,205 natural products. Hit rates of 26% and 12%, respectively, were obtained when we tested the top-ranked predicted compounds for growth inhibitory activity against B. cenocepacia, which represents at least a 14-fold increase from the previous hit rate. In addition, more than 51% of the predicted antibacterial natural compounds inhibited ESKAPE pathogens showing that predictions expand beyond the organism-specific dataset to a broad range of bacteria. Overall, the developed ML approach can be used for compound prioritization before screening, increasing the typical hit rate of drug discovery. Public Library of Science 2022-10-13 /pmc/articles/PMC9624395/ /pubmed/36228001 http://dx.doi.org/10.1371/journal.pcbi.1010613 Text en © 2022 Rahman et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Rahman, A. S. M. Zisanur
Liu, Chengyou
Sturm, Hunter
Hogan, Andrew M.
Davis, Rebecca
Hu, Pingzhao
Cardona, Silvia T.
A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery
title A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery
title_full A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery
title_fullStr A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery
title_full_unstemmed A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery
title_short A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery
title_sort machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9624395/
https://www.ncbi.nlm.nih.gov/pubmed/36228001
http://dx.doi.org/10.1371/journal.pcbi.1010613
work_keys_str_mv AT rahmanasmzisanur amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT liuchengyou amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT sturmhunter amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT hoganandrewm amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT davisrebecca amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT hupingzhao amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT cardonasilviat amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT rahmanasmzisanur machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT liuchengyou machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT sturmhunter machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT hoganandrewm machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT davisrebecca machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT hupingzhao machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery
AT cardonasilviat machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery