Cargando…

Machine learning enhances prediction of plants as potential sources of antimalarials

Plants are a rich source of bioactive compounds and a number of plant-derived antiplasmodial compounds have been developed into pharmaceutical drugs for the prevention and treatment of malaria, a major public health challenge. However, identifying plants with antiplasmodial potential can be time-con...

Descripción completa

Detalles Bibliográficos
Autores principales: Richard-Bollans, Adam, Aitken, Conal, Antonelli, Alexandre, Bitencourt, Cássia, Goyder, David, Lucas, Eve, Ondo, Ian, Pérez-Escobar, Oscar A., Pironon, Samuel, Richardson, James E., Russell, David, Silvestro, Daniele, Wright, Colin W., Howes, Melanie-Jayne R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10248027/
https://www.ncbi.nlm.nih.gov/pubmed/37304721
http://dx.doi.org/10.3389/fpls.2023.1173328
_version_ 1785055281439309824
author Richard-Bollans, Adam
Aitken, Conal
Antonelli, Alexandre
Bitencourt, Cássia
Goyder, David
Lucas, Eve
Ondo, Ian
Pérez-Escobar, Oscar A.
Pironon, Samuel
Richardson, James E.
Russell, David
Silvestro, Daniele
Wright, Colin W.
Howes, Melanie-Jayne R.
author_facet Richard-Bollans, Adam
Aitken, Conal
Antonelli, Alexandre
Bitencourt, Cássia
Goyder, David
Lucas, Eve
Ondo, Ian
Pérez-Escobar, Oscar A.
Pironon, Samuel
Richardson, James E.
Russell, David
Silvestro, Daniele
Wright, Colin W.
Howes, Melanie-Jayne R.
author_sort Richard-Bollans, Adam
collection PubMed
description Plants are a rich source of bioactive compounds and a number of plant-derived antiplasmodial compounds have been developed into pharmaceutical drugs for the prevention and treatment of malaria, a major public health challenge. However, identifying plants with antiplasmodial potential can be time-consuming and costly. One approach for selecting plants to investigate is based on ethnobotanical knowledge which, though having provided some major successes, is restricted to a relatively small group of plant species. Machine learning, incorporating ethnobotanical and plant trait data, provides a promising approach to improve the identification of antiplasmodial plants and accelerate the search for new plant-derived antiplasmodial compounds. In this paper we present a novel dataset on antiplasmodial activity for three flowering plant families – Apocynaceae, Loganiaceae and Rubiaceae (together comprising c. 21,100 species) – and demonstrate the ability of machine learning algorithms to predict the antiplasmodial potential of plant species. We evaluate the predictive capability of a variety of algorithms – Support Vector Machines, Logistic Regression, Gradient Boosted Trees and Bayesian Neural Networks – and compare these to two ethnobotanical selection approaches – based on usage as an antimalarial and general usage as a medicine. We evaluate the approaches using the given data and when the given samples are reweighted to correct for sampling biases. In both evaluation settings each of the machine learning models have a higher precision than the ethnobotanical approaches. In the bias-corrected scenario, the Support Vector classifier performs best – attaining a mean precision of 0.67 compared to the best performing ethnobotanical approach with a mean precision of 0.46. We also use the bias correction method and the Support Vector classifier to estimate the potential of plants to provide novel antiplasmodial compounds. We estimate that 7677 species in Apocynaceae, Loganiaceae and Rubiaceae warrant further investigation and that at least 1300 active antiplasmodial species are highly unlikely to be investigated by conventional approaches. While traditional and Indigenous knowledge remains vital to our understanding of people-plant relationships and an invaluable source of information, these results indicate a vast and relatively untapped source in the search for new plant-derived antiplasmodial compounds.
format Online
Article
Text
id pubmed-10248027
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-102480272023-06-09 Machine learning enhances prediction of plants as potential sources of antimalarials Richard-Bollans, Adam Aitken, Conal Antonelli, Alexandre Bitencourt, Cássia Goyder, David Lucas, Eve Ondo, Ian Pérez-Escobar, Oscar A. Pironon, Samuel Richardson, James E. Russell, David Silvestro, Daniele Wright, Colin W. Howes, Melanie-Jayne R. Front Plant Sci Plant Science Plants are a rich source of bioactive compounds and a number of plant-derived antiplasmodial compounds have been developed into pharmaceutical drugs for the prevention and treatment of malaria, a major public health challenge. However, identifying plants with antiplasmodial potential can be time-consuming and costly. One approach for selecting plants to investigate is based on ethnobotanical knowledge which, though having provided some major successes, is restricted to a relatively small group of plant species. Machine learning, incorporating ethnobotanical and plant trait data, provides a promising approach to improve the identification of antiplasmodial plants and accelerate the search for new plant-derived antiplasmodial compounds. In this paper we present a novel dataset on antiplasmodial activity for three flowering plant families – Apocynaceae, Loganiaceae and Rubiaceae (together comprising c. 21,100 species) – and demonstrate the ability of machine learning algorithms to predict the antiplasmodial potential of plant species. We evaluate the predictive capability of a variety of algorithms – Support Vector Machines, Logistic Regression, Gradient Boosted Trees and Bayesian Neural Networks – and compare these to two ethnobotanical selection approaches – based on usage as an antimalarial and general usage as a medicine. We evaluate the approaches using the given data and when the given samples are reweighted to correct for sampling biases. In both evaluation settings each of the machine learning models have a higher precision than the ethnobotanical approaches. In the bias-corrected scenario, the Support Vector classifier performs best – attaining a mean precision of 0.67 compared to the best performing ethnobotanical approach with a mean precision of 0.46. We also use the bias correction method and the Support Vector classifier to estimate the potential of plants to provide novel antiplasmodial compounds. We estimate that 7677 species in Apocynaceae, Loganiaceae and Rubiaceae warrant further investigation and that at least 1300 active antiplasmodial species are highly unlikely to be investigated by conventional approaches. While traditional and Indigenous knowledge remains vital to our understanding of people-plant relationships and an invaluable source of information, these results indicate a vast and relatively untapped source in the search for new plant-derived antiplasmodial compounds. Frontiers Media S.A. 2023-05-25 /pmc/articles/PMC10248027/ /pubmed/37304721 http://dx.doi.org/10.3389/fpls.2023.1173328 Text en Copyright © 2023 Richard-Bollans, Aitken, Antonelli, Bitencourt, Goyder, Lucas, Ondo, Pérez-Escobar, Pironon, Richardson, Russell, Silvestro, Wright and Howes https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Richard-Bollans, Adam
Aitken, Conal
Antonelli, Alexandre
Bitencourt, Cássia
Goyder, David
Lucas, Eve
Ondo, Ian
Pérez-Escobar, Oscar A.
Pironon, Samuel
Richardson, James E.
Russell, David
Silvestro, Daniele
Wright, Colin W.
Howes, Melanie-Jayne R.
Machine learning enhances prediction of plants as potential sources of antimalarials
title Machine learning enhances prediction of plants as potential sources of antimalarials
title_full Machine learning enhances prediction of plants as potential sources of antimalarials
title_fullStr Machine learning enhances prediction of plants as potential sources of antimalarials
title_full_unstemmed Machine learning enhances prediction of plants as potential sources of antimalarials
title_short Machine learning enhances prediction of plants as potential sources of antimalarials
title_sort machine learning enhances prediction of plants as potential sources of antimalarials
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10248027/
https://www.ncbi.nlm.nih.gov/pubmed/37304721
http://dx.doi.org/10.3389/fpls.2023.1173328
work_keys_str_mv AT richardbollansadam machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT aitkenconal machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT antonellialexandre machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT bitencourtcassia machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT goyderdavid machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT lucaseve machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT ondoian machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT perezescobaroscara machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT pirononsamuel machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT richardsonjamese machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT russelldavid machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT silvestrodaniele machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT wrightcolinw machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials
AT howesmelaniejayner machinelearningenhancespredictionofplantsaspotentialsourcesofantimalarials