Cargando…
Deep learning-driven prediction of drug mechanism of action from large-scale chemical-genetic interaction profiles
MOTIVATION: Chemical–genetic interaction profiling is a genetic approach that quantifies the susceptibility of a set of mutants depleted in specific gene product(s) to a set of chemical compounds. With the recent advances in artificial intelligence, chemical–genetic interaction profiles (CGIPs) can...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8917716/ https://www.ncbi.nlm.nih.gov/pubmed/35279211 http://dx.doi.org/10.1186/s13321-022-00596-6 |
_version_ | 1784668607847858176 |
---|---|
author | Liu, Chengyou Hogan, Andrew M. Sturm, Hunter Khan, Mohd Wasif Islam, Md. Mohaiminul Rahman, A. S. M. Zisanur Davis, Rebecca Cardona, Silvia T. Hu, Pingzhao |
author_facet | Liu, Chengyou Hogan, Andrew M. Sturm, Hunter Khan, Mohd Wasif Islam, Md. Mohaiminul Rahman, A. S. M. Zisanur Davis, Rebecca Cardona, Silvia T. Hu, Pingzhao |
author_sort | Liu, Chengyou |
collection | PubMed |
description | MOTIVATION: Chemical–genetic interaction profiling is a genetic approach that quantifies the susceptibility of a set of mutants depleted in specific gene product(s) to a set of chemical compounds. With the recent advances in artificial intelligence, chemical–genetic interaction profiles (CGIPs) can be leveraged to predict mechanism of action of compounds. This can be achieved by using machine learning, where the data from a CGIP is fed into the machine learning platform along with the chemical descriptors to develop a chemogenetically trained model. As small molecules can be considered non-structural data, graph convolutional neural networks, which can learn from the chemical structures directly, can be used to successfully predict molecular properties. Clustering analysis, on the other hand, is a critical approach to get insights into the underlying biological relationships between the gene products in the high-dimensional chemical-genetic data. METHODS AND RESULTS: In this study, we proposed a comprehensive framework based on the large-scale chemical-genetics dataset built in Mycobacterium tuberculosis for predicting CGIPs using graph-based deep learning models. Our approach is structured into three parts. First, by matching M. tuberculosis genes with homologous genes in Escherichia coli (E. coli) according to their gene products, we grouped the genes into clusters with distinct biological functions. Second, we employed a directed message passing neural network to predict growth inhibition against M. tuberculosis gene clusters using a collection of 50,000 chemicals with the profile. We compared the performance of different baseline models and implemented multi-label tasks in binary classification frameworks. Lastly, we applied the trained model to an externally curated drug set that had experimental results against M. tuberculosis genes to examine the effectiveness of our method. Overall, we demonstrate that our approach effectively created M. tuberculosis gene clusters, and the trained classifier is able to predict activity against essential M. tuberculosis targets with high accuracy. CONCLUSION: This work provides an analytical framework for modeling large-scale chemical-genetic datasets for predicting CGIPs and generating hypothesis about mechanism of action of novel drugs. In addition, this work highlights the importance of graph-based deep neural networks in drug discovery. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-022-00596-6. |
format | Online Article Text |
id | pubmed-8917716 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-89177162022-03-17 Deep learning-driven prediction of drug mechanism of action from large-scale chemical-genetic interaction profiles Liu, Chengyou Hogan, Andrew M. Sturm, Hunter Khan, Mohd Wasif Islam, Md. Mohaiminul Rahman, A. S. M. Zisanur Davis, Rebecca Cardona, Silvia T. Hu, Pingzhao J Cheminform Research Article MOTIVATION: Chemical–genetic interaction profiling is a genetic approach that quantifies the susceptibility of a set of mutants depleted in specific gene product(s) to a set of chemical compounds. With the recent advances in artificial intelligence, chemical–genetic interaction profiles (CGIPs) can be leveraged to predict mechanism of action of compounds. This can be achieved by using machine learning, where the data from a CGIP is fed into the machine learning platform along with the chemical descriptors to develop a chemogenetically trained model. As small molecules can be considered non-structural data, graph convolutional neural networks, which can learn from the chemical structures directly, can be used to successfully predict molecular properties. Clustering analysis, on the other hand, is a critical approach to get insights into the underlying biological relationships between the gene products in the high-dimensional chemical-genetic data. METHODS AND RESULTS: In this study, we proposed a comprehensive framework based on the large-scale chemical-genetics dataset built in Mycobacterium tuberculosis for predicting CGIPs using graph-based deep learning models. Our approach is structured into three parts. First, by matching M. tuberculosis genes with homologous genes in Escherichia coli (E. coli) according to their gene products, we grouped the genes into clusters with distinct biological functions. Second, we employed a directed message passing neural network to predict growth inhibition against M. tuberculosis gene clusters using a collection of 50,000 chemicals with the profile. We compared the performance of different baseline models and implemented multi-label tasks in binary classification frameworks. Lastly, we applied the trained model to an externally curated drug set that had experimental results against M. tuberculosis genes to examine the effectiveness of our method. Overall, we demonstrate that our approach effectively created M. tuberculosis gene clusters, and the trained classifier is able to predict activity against essential M. tuberculosis targets with high accuracy. CONCLUSION: This work provides an analytical framework for modeling large-scale chemical-genetic datasets for predicting CGIPs and generating hypothesis about mechanism of action of novel drugs. In addition, this work highlights the importance of graph-based deep neural networks in drug discovery. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-022-00596-6. Springer International Publishing 2022-03-12 /pmc/articles/PMC8917716/ /pubmed/35279211 http://dx.doi.org/10.1186/s13321-022-00596-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Liu, Chengyou Hogan, Andrew M. Sturm, Hunter Khan, Mohd Wasif Islam, Md. Mohaiminul Rahman, A. S. M. Zisanur Davis, Rebecca Cardona, Silvia T. Hu, Pingzhao Deep learning-driven prediction of drug mechanism of action from large-scale chemical-genetic interaction profiles |
title | Deep learning-driven prediction of drug mechanism of action from large-scale chemical-genetic interaction profiles |
title_full | Deep learning-driven prediction of drug mechanism of action from large-scale chemical-genetic interaction profiles |
title_fullStr | Deep learning-driven prediction of drug mechanism of action from large-scale chemical-genetic interaction profiles |
title_full_unstemmed | Deep learning-driven prediction of drug mechanism of action from large-scale chemical-genetic interaction profiles |
title_short | Deep learning-driven prediction of drug mechanism of action from large-scale chemical-genetic interaction profiles |
title_sort | deep learning-driven prediction of drug mechanism of action from large-scale chemical-genetic interaction profiles |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8917716/ https://www.ncbi.nlm.nih.gov/pubmed/35279211 http://dx.doi.org/10.1186/s13321-022-00596-6 |
work_keys_str_mv | AT liuchengyou deeplearningdrivenpredictionofdrugmechanismofactionfromlargescalechemicalgeneticinteractionprofiles AT hoganandrewm deeplearningdrivenpredictionofdrugmechanismofactionfromlargescalechemicalgeneticinteractionprofiles AT sturmhunter deeplearningdrivenpredictionofdrugmechanismofactionfromlargescalechemicalgeneticinteractionprofiles AT khanmohdwasif deeplearningdrivenpredictionofdrugmechanismofactionfromlargescalechemicalgeneticinteractionprofiles AT islammdmohaiminul deeplearningdrivenpredictionofdrugmechanismofactionfromlargescalechemicalgeneticinteractionprofiles AT rahmanasmzisanur deeplearningdrivenpredictionofdrugmechanismofactionfromlargescalechemicalgeneticinteractionprofiles AT davisrebecca deeplearningdrivenpredictionofdrugmechanismofactionfromlargescalechemicalgeneticinteractionprofiles AT cardonasilviat deeplearningdrivenpredictionofdrugmechanismofactionfromlargescalechemicalgeneticinteractionprofiles AT hupingzhao deeplearningdrivenpredictionofdrugmechanismofactionfromlargescalechemicalgeneticinteractionprofiles |