Cargando…

Bioactivity assessment of natural compounds using machine learning models trained on target similarity between drugs

Natural compounds constitute a rich resource of potential small molecule therapeutics. While experimental access to this resource is limited due to its vast diversity and difficulties in systematic purification, computational assessment of structural similarity with known therapeutic molecules offer...

Descripción completa

Detalles Bibliográficos
Autores principales: Periwal, Vinita, Bassler, Stefan, Andrejev, Sergej, Gabrielli, Natalia, Patil, Kaustubh Raosaheb, Typas, Athanasios, Patil, Kiran Raosaheb
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9071136/
https://www.ncbi.nlm.nih.gov/pubmed/35468126
http://dx.doi.org/10.1371/journal.pcbi.1010029
_version_ 1784700785651613696
author Periwal, Vinita
Bassler, Stefan
Andrejev, Sergej
Gabrielli, Natalia
Patil, Kaustubh Raosaheb
Typas, Athanasios
Patil, Kiran Raosaheb
author_facet Periwal, Vinita
Bassler, Stefan
Andrejev, Sergej
Gabrielli, Natalia
Patil, Kaustubh Raosaheb
Typas, Athanasios
Patil, Kiran Raosaheb
author_sort Periwal, Vinita
collection PubMed
description Natural compounds constitute a rich resource of potential small molecule therapeutics. While experimental access to this resource is limited due to its vast diversity and difficulties in systematic purification, computational assessment of structural similarity with known therapeutic molecules offers a scalable approach. Here, we assessed functional similarity between natural compounds and approved drugs by combining multiple chemical similarity metrics and physicochemical properties using a machine-learning approach. We computed pairwise similarities between 1410 drugs for training classification models and used the drugs shared protein targets as class labels. The best performing models were random forest which gave an average area under the ROC of 0.9, Matthews correlation coefficient of 0.35, and F1 score of 0.33, suggesting that it captured the structure-activity relation well. The models were then used to predict protein targets of circa 11k natural compounds by comparing them with the drugs. This revealed therapeutic potential of several natural compounds, including those with support from previously published sources as well as those hitherto unexplored. We experimentally validated one of the predicted pair’s activities, viz., Cox-1 inhibition by 5-methoxysalicylic acid, a molecule commonly found in tea, herbs and spices. In contrast, another natural compound, 4-isopropylbenzoic acid, with the highest similarity score when considering most weighted similarity metric but not picked by our models, did not inhibit Cox-1. Our results demonstrate the utility of a machine-learning approach combining multiple chemical features for uncovering protein binding potential of natural compounds.
format Online
Article
Text
id pubmed-9071136
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-90711362022-05-06 Bioactivity assessment of natural compounds using machine learning models trained on target similarity between drugs Periwal, Vinita Bassler, Stefan Andrejev, Sergej Gabrielli, Natalia Patil, Kaustubh Raosaheb Typas, Athanasios Patil, Kiran Raosaheb PLoS Comput Biol Research Article Natural compounds constitute a rich resource of potential small molecule therapeutics. While experimental access to this resource is limited due to its vast diversity and difficulties in systematic purification, computational assessment of structural similarity with known therapeutic molecules offers a scalable approach. Here, we assessed functional similarity between natural compounds and approved drugs by combining multiple chemical similarity metrics and physicochemical properties using a machine-learning approach. We computed pairwise similarities between 1410 drugs for training classification models and used the drugs shared protein targets as class labels. The best performing models were random forest which gave an average area under the ROC of 0.9, Matthews correlation coefficient of 0.35, and F1 score of 0.33, suggesting that it captured the structure-activity relation well. The models were then used to predict protein targets of circa 11k natural compounds by comparing them with the drugs. This revealed therapeutic potential of several natural compounds, including those with support from previously published sources as well as those hitherto unexplored. We experimentally validated one of the predicted pair’s activities, viz., Cox-1 inhibition by 5-methoxysalicylic acid, a molecule commonly found in tea, herbs and spices. In contrast, another natural compound, 4-isopropylbenzoic acid, with the highest similarity score when considering most weighted similarity metric but not picked by our models, did not inhibit Cox-1. Our results demonstrate the utility of a machine-learning approach combining multiple chemical features for uncovering protein binding potential of natural compounds. Public Library of Science 2022-04-25 /pmc/articles/PMC9071136/ /pubmed/35468126 http://dx.doi.org/10.1371/journal.pcbi.1010029 Text en © 2022 Periwal et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Periwal, Vinita
Bassler, Stefan
Andrejev, Sergej
Gabrielli, Natalia
Patil, Kaustubh Raosaheb
Typas, Athanasios
Patil, Kiran Raosaheb
Bioactivity assessment of natural compounds using machine learning models trained on target similarity between drugs
title Bioactivity assessment of natural compounds using machine learning models trained on target similarity between drugs
title_full Bioactivity assessment of natural compounds using machine learning models trained on target similarity between drugs
title_fullStr Bioactivity assessment of natural compounds using machine learning models trained on target similarity between drugs
title_full_unstemmed Bioactivity assessment of natural compounds using machine learning models trained on target similarity between drugs
title_short Bioactivity assessment of natural compounds using machine learning models trained on target similarity between drugs
title_sort bioactivity assessment of natural compounds using machine learning models trained on target similarity between drugs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9071136/
https://www.ncbi.nlm.nih.gov/pubmed/35468126
http://dx.doi.org/10.1371/journal.pcbi.1010029
work_keys_str_mv AT periwalvinita bioactivityassessmentofnaturalcompoundsusingmachinelearningmodelstrainedontargetsimilaritybetweendrugs
AT basslerstefan bioactivityassessmentofnaturalcompoundsusingmachinelearningmodelstrainedontargetsimilaritybetweendrugs
AT andrejevsergej bioactivityassessmentofnaturalcompoundsusingmachinelearningmodelstrainedontargetsimilaritybetweendrugs
AT gabriellinatalia bioactivityassessmentofnaturalcompoundsusingmachinelearningmodelstrainedontargetsimilaritybetweendrugs
AT patilkaustubhraosaheb bioactivityassessmentofnaturalcompoundsusingmachinelearningmodelstrainedontargetsimilaritybetweendrugs
AT typasathanasios bioactivityassessmentofnaturalcompoundsusingmachinelearningmodelstrainedontargetsimilaritybetweendrugs
AT patilkiranraosaheb bioactivityassessmentofnaturalcompoundsusingmachinelearningmodelstrainedontargetsimilaritybetweendrugs