Cargando…
A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery
Screening for novel antibacterial compounds in small molecule libraries has a low success rate. We applied machine learning (ML)-based virtual screening for antibacterial activity and evaluated its predictive power by experimental validation. We first binarized 29,537 compounds according to their gr...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9624395/ https://www.ncbi.nlm.nih.gov/pubmed/36228001 http://dx.doi.org/10.1371/journal.pcbi.1010613 |
_version_ | 1784822226007097344 |
---|---|
author | Rahman, A. S. M. Zisanur Liu, Chengyou Sturm, Hunter Hogan, Andrew M. Davis, Rebecca Hu, Pingzhao Cardona, Silvia T. |
author_facet | Rahman, A. S. M. Zisanur Liu, Chengyou Sturm, Hunter Hogan, Andrew M. Davis, Rebecca Hu, Pingzhao Cardona, Silvia T. |
author_sort | Rahman, A. S. M. Zisanur |
collection | PubMed |
description | Screening for novel antibacterial compounds in small molecule libraries has a low success rate. We applied machine learning (ML)-based virtual screening for antibacterial activity and evaluated its predictive power by experimental validation. We first binarized 29,537 compounds according to their growth inhibitory activity (hit rate 0.87%) against the antibiotic-resistant bacterium Burkholderia cenocepacia and described their molecular features with a directed-message passing neural network (D-MPNN). Then, we used the data to train an ML model that achieved a receiver operating characteristic (ROC) score of 0.823 on the test set. Finally, we predicted antibacterial activity in virtual libraries corresponding to 1,614 compounds from the Food and Drug Administration (FDA)-approved list and 224,205 natural products. Hit rates of 26% and 12%, respectively, were obtained when we tested the top-ranked predicted compounds for growth inhibitory activity against B. cenocepacia, which represents at least a 14-fold increase from the previous hit rate. In addition, more than 51% of the predicted antibacterial natural compounds inhibited ESKAPE pathogens showing that predictions expand beyond the organism-specific dataset to a broad range of bacteria. Overall, the developed ML approach can be used for compound prioritization before screening, increasing the typical hit rate of drug discovery. |
format | Online Article Text |
id | pubmed-9624395 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-96243952022-11-02 A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery Rahman, A. S. M. Zisanur Liu, Chengyou Sturm, Hunter Hogan, Andrew M. Davis, Rebecca Hu, Pingzhao Cardona, Silvia T. PLoS Comput Biol Research Article Screening for novel antibacterial compounds in small molecule libraries has a low success rate. We applied machine learning (ML)-based virtual screening for antibacterial activity and evaluated its predictive power by experimental validation. We first binarized 29,537 compounds according to their growth inhibitory activity (hit rate 0.87%) against the antibiotic-resistant bacterium Burkholderia cenocepacia and described their molecular features with a directed-message passing neural network (D-MPNN). Then, we used the data to train an ML model that achieved a receiver operating characteristic (ROC) score of 0.823 on the test set. Finally, we predicted antibacterial activity in virtual libraries corresponding to 1,614 compounds from the Food and Drug Administration (FDA)-approved list and 224,205 natural products. Hit rates of 26% and 12%, respectively, were obtained when we tested the top-ranked predicted compounds for growth inhibitory activity against B. cenocepacia, which represents at least a 14-fold increase from the previous hit rate. In addition, more than 51% of the predicted antibacterial natural compounds inhibited ESKAPE pathogens showing that predictions expand beyond the organism-specific dataset to a broad range of bacteria. Overall, the developed ML approach can be used for compound prioritization before screening, increasing the typical hit rate of drug discovery. Public Library of Science 2022-10-13 /pmc/articles/PMC9624395/ /pubmed/36228001 http://dx.doi.org/10.1371/journal.pcbi.1010613 Text en © 2022 Rahman et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Rahman, A. S. M. Zisanur Liu, Chengyou Sturm, Hunter Hogan, Andrew M. Davis, Rebecca Hu, Pingzhao Cardona, Silvia T. A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery |
title | A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery |
title_full | A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery |
title_fullStr | A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery |
title_full_unstemmed | A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery |
title_short | A machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery |
title_sort | machine learning model trained on a high-throughput antibacterial screen increases the hit rate of drug discovery |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9624395/ https://www.ncbi.nlm.nih.gov/pubmed/36228001 http://dx.doi.org/10.1371/journal.pcbi.1010613 |
work_keys_str_mv | AT rahmanasmzisanur amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT liuchengyou amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT sturmhunter amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT hoganandrewm amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT davisrebecca amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT hupingzhao amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT cardonasilviat amachinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT rahmanasmzisanur machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT liuchengyou machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT sturmhunter machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT hoganandrewm machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT davisrebecca machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT hupingzhao machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery AT cardonasilviat machinelearningmodeltrainedonahighthroughputantibacterialscreenincreasesthehitrateofdrugdiscovery |