Cargando…

Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study

Drug discovery (DD) research is a complex field with a high attrition rate. Machine learning (ML) approaches combined to chemoinformatics are of valuable input to this field. We, herein, focused on implementing multiple ML algorithms that shall learn from different molecular fingerprints (FPs) of 65...

Descripción completa

Detalles Bibliográficos
Autores principales: Harigua-Souiai, Emna, Oualha, Rafeh, Souiai, Oussama, Abdeljaoued-Tej, Ines, Guizani, Ikram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9036323/
https://www.ncbi.nlm.nih.gov/pubmed/35478992
http://dx.doi.org/10.1177/11779322221090349
_version_ 1784693498911391744
author Harigua-Souiai, Emna
Oualha, Rafeh
Souiai, Oussama
Abdeljaoued-Tej, Ines
Guizani, Ikram
author_facet Harigua-Souiai, Emna
Oualha, Rafeh
Souiai, Oussama
Abdeljaoued-Tej, Ines
Guizani, Ikram
author_sort Harigua-Souiai, Emna
collection PubMed
description Drug discovery (DD) research is a complex field with a high attrition rate. Machine learning (ML) approaches combined to chemoinformatics are of valuable input to this field. We, herein, focused on implementing multiple ML algorithms that shall learn from different molecular fingerprints (FPs) of 65 057 molecules that have been identified as active or inactive against Leishmania major promastigotes. We sought to build a classifier able to predict whether a given molecule has the potential of being anti-leishmanial or not. Using the RDkit library, we calculated 5 molecular FPs of the molecules. Then, we implemented 4 ML algorithms that we trained and tested for their ability to classify the molecules into active/inactive classes based on their chemical structure, encoded by the molecular FPs. Best performers were random forest (RF) and support vector machine (SVM), while atom-pair and topology torsion FPs were the best embedding functions. Both models were further assessed on different stratification levels of the dataset and showed stable performances. At last, we used them to predict the potential of molecules within the Food and Drug Administration (FDA)-approved drugs collection to present anti-Leishmania effects. We ranked these drugs according to their anti-Leishmanial probability and obtained in total seven anti-Leishmania agents, previously described in the literature, within the top 10 of each model. This validates the robustness of the approach, the algorithms, and FPs choices as well as the importance of the dataset size and content. We further engaged these molecules into reverse docking experiments on 3D crystal structures of seven well-studied Leishmania drug targets and could predict the molecular targets for 4 drugs. The results bring novel insights into anti-Leishmania compounds.
format Online
Article
Text
id pubmed-9036323
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-90363232022-04-26 Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study Harigua-Souiai, Emna Oualha, Rafeh Souiai, Oussama Abdeljaoued-Tej, Ines Guizani, Ikram Bioinform Biol Insights Original Research Drug discovery (DD) research is a complex field with a high attrition rate. Machine learning (ML) approaches combined to chemoinformatics are of valuable input to this field. We, herein, focused on implementing multiple ML algorithms that shall learn from different molecular fingerprints (FPs) of 65 057 molecules that have been identified as active or inactive against Leishmania major promastigotes. We sought to build a classifier able to predict whether a given molecule has the potential of being anti-leishmanial or not. Using the RDkit library, we calculated 5 molecular FPs of the molecules. Then, we implemented 4 ML algorithms that we trained and tested for their ability to classify the molecules into active/inactive classes based on their chemical structure, encoded by the molecular FPs. Best performers were random forest (RF) and support vector machine (SVM), while atom-pair and topology torsion FPs were the best embedding functions. Both models were further assessed on different stratification levels of the dataset and showed stable performances. At last, we used them to predict the potential of molecules within the Food and Drug Administration (FDA)-approved drugs collection to present anti-Leishmania effects. We ranked these drugs according to their anti-Leishmanial probability and obtained in total seven anti-Leishmania agents, previously described in the literature, within the top 10 of each model. This validates the robustness of the approach, the algorithms, and FPs choices as well as the importance of the dataset size and content. We further engaged these molecules into reverse docking experiments on 3D crystal structures of seven well-studied Leishmania drug targets and could predict the molecular targets for 4 drugs. The results bring novel insights into anti-Leishmania compounds. SAGE Publications 2022-04-22 /pmc/articles/PMC9036323/ /pubmed/35478992 http://dx.doi.org/10.1177/11779322221090349 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Harigua-Souiai, Emna
Oualha, Rafeh
Souiai, Oussama
Abdeljaoued-Tej, Ines
Guizani, Ikram
Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study
title Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study
title_full Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study
title_fullStr Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study
title_full_unstemmed Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study
title_short Applied Machine Learning Toward Drug Discovery Enhancement: Leishmaniases as a Case Study
title_sort applied machine learning toward drug discovery enhancement: leishmaniases as a case study
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9036323/
https://www.ncbi.nlm.nih.gov/pubmed/35478992
http://dx.doi.org/10.1177/11779322221090349
work_keys_str_mv AT hariguasouiaiemna appliedmachinelearningtowarddrugdiscoveryenhancementleishmaniasesasacasestudy
AT oualharafeh appliedmachinelearningtowarddrugdiscoveryenhancementleishmaniasesasacasestudy
AT souiaioussama appliedmachinelearningtowarddrugdiscoveryenhancementleishmaniasesasacasestudy
AT abdeljaouedtejines appliedmachinelearningtowarddrugdiscoveryenhancementleishmaniasesasacasestudy
AT guizaniikram appliedmachinelearningtowarddrugdiscoveryenhancementleishmaniasesasacasestudy