Cargando…
Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study
Drug discovery (DD) research is a complex field with a high attrition rate. Machine learning (ML) approaches combined to chemoinformatics are of valuable input to this field. We, herein, focused on implementing multiple ML algorithms that shall learn from different molecular fingerprints (FPs) of 65...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9036323/ https://www.ncbi.nlm.nih.gov/pubmed/35478992 http://dx.doi.org/10.1177/11779322221090349 |
_version_ | 1784693498911391744 |
---|---|
author | Harigua-Souiai, Emna Oualha, Rafeh Souiai, Oussama Abdeljaoued-Tej, Ines Guizani, Ikram |
author_facet | Harigua-Souiai, Emna Oualha, Rafeh Souiai, Oussama Abdeljaoued-Tej, Ines Guizani, Ikram |
author_sort | Harigua-Souiai, Emna |
collection | PubMed |
description | Drug discovery (DD) research is a complex field with a high attrition rate. Machine learning (ML) approaches combined to chemoinformatics are of valuable input to this field. We, herein, focused on implementing multiple ML algorithms that shall learn from different molecular fingerprints (FPs) of 65 057 molecules that have been identified as active or inactive against Leishmania major promastigotes. We sought to build a classifier able to predict whether a given molecule has the potential of being anti-leishmanial or not. Using the RDkit library, we calculated 5 molecular FPs of the molecules. Then, we implemented 4 ML algorithms that we trained and tested for their ability to classify the molecules into active/inactive classes based on their chemical structure, encoded by the molecular FPs. Best performers were random forest (RF) and support vector machine (SVM), while atom-pair and topology torsion FPs were the best embedding functions. Both models were further assessed on different stratification levels of the dataset and showed stable performances. At last, we used them to predict the potential of molecules within the Food and Drug Administration (FDA)-approved drugs collection to present anti-Leishmania effects. We ranked these drugs according to their anti-Leishmanial probability and obtained in total seven anti-Leishmania agents, previously described in the literature, within the top 10 of each model. This validates the robustness of the approach, the algorithms, and FPs choices as well as the importance of the dataset size and content. We further engaged these molecules into reverse docking experiments on 3D crystal structures of seven well-studied Leishmania drug targets and could predict the molecular targets for 4 drugs. The results bring novel insights into anti-Leishmania compounds. |
format | Online Article Text |
id | pubmed-9036323 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-90363232022-04-26 Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study Harigua-Souiai, Emna Oualha, Rafeh Souiai, Oussama Abdeljaoued-Tej, Ines Guizani, Ikram Bioinform Biol Insights Original Research Drug discovery (DD) research is a complex field with a high attrition rate. Machine learning (ML) approaches combined to chemoinformatics are of valuable input to this field. We, herein, focused on implementing multiple ML algorithms that shall learn from different molecular fingerprints (FPs) of 65 057 molecules that have been identified as active or inactive against Leishmania major promastigotes. We sought to build a classifier able to predict whether a given molecule has the potential of being anti-leishmanial or not. Using the RDkit library, we calculated 5 molecular FPs of the molecules. Then, we implemented 4 ML algorithms that we trained and tested for their ability to classify the molecules into active/inactive classes based on their chemical structure, encoded by the molecular FPs. Best performers were random forest (RF) and support vector machine (SVM), while atom-pair and topology torsion FPs were the best embedding functions. Both models were further assessed on different stratification levels of the dataset and showed stable performances. At last, we used them to predict the potential of molecules within the Food and Drug Administration (FDA)-approved drugs collection to present anti-Leishmania effects. We ranked these drugs according to their anti-Leishmanial probability and obtained in total seven anti-Leishmania agents, previously described in the literature, within the top 10 of each model. This validates the robustness of the approach, the algorithms, and FPs choices as well as the importance of the dataset size and content. We further engaged these molecules into reverse docking experiments on 3D crystal structures of seven well-studied Leishmania drug targets and could predict the molecular targets for 4 drugs. The results bring novel insights into anti-Leishmania compounds. SAGE Publications 2022-04-22 /pmc/articles/PMC9036323/ /pubmed/35478992 http://dx.doi.org/10.1177/11779322221090349 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Research Harigua-Souiai, Emna Oualha, Rafeh Souiai, Oussama Abdeljaoued-Tej, Ines Guizani, Ikram Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study |
title | Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study |
title_full | Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study |
title_fullStr | Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study |
title_full_unstemmed | Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study |
title_short | Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study |
title_sort | applied machine learning toward drug discovery enhancement: leishmaniases as a case study |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9036323/ https://www.ncbi.nlm.nih.gov/pubmed/35478992 http://dx.doi.org/10.1177/11779322221090349 |
work_keys_str_mv | AT hariguasouiaiemna appliedmachinelearningtowarddrugdiscoveryenhancementleishmaniasesasacasestudy AT oualharafeh appliedmachinelearningtowarddrugdiscoveryenhancementleishmaniasesasacasestudy AT souiaioussama appliedmachinelearningtowarddrugdiscoveryenhancementleishmaniasesasacasestudy AT abdeljaouedtejines appliedmachinelearningtowarddrugdiscoveryenhancementleishmaniasesasacasestudy AT guizaniikram appliedmachinelearningtowarddrugdiscoveryenhancementleishmaniasesasacasestudy |