Cargando…

Predicting lifespan-extending chemical compounds for C. elegans with machine learning and biologically interpretable features

Recently, there has been a growing interest in the development of pharmacological interventions targeting ageing, as well as in the use of machine learning for analysing ageing-related data. In this work, we use machine learning methods to analyse data from DrugAge, a database of chemical compounds...

Descripción completa

Detalles Bibliográficos
Autores principales: Ribeiro, Caio, Farmer, Christopher K., de Magalhães, João Pedro, Freitas, Alex A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Impact Journals 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10373959/
https://www.ncbi.nlm.nih.gov/pubmed/37450404
http://dx.doi.org/10.18632/aging.204866
_version_ 1785078670355857408
author Ribeiro, Caio
Farmer, Christopher K.
de Magalhães, João Pedro
Freitas, Alex A.
author_facet Ribeiro, Caio
Farmer, Christopher K.
de Magalhães, João Pedro
Freitas, Alex A.
author_sort Ribeiro, Caio
collection PubMed
description Recently, there has been a growing interest in the development of pharmacological interventions targeting ageing, as well as in the use of machine learning for analysing ageing-related data. In this work, we use machine learning methods to analyse data from DrugAge, a database of chemical compounds (including drugs) modulating lifespan in model organisms. To this end, we created four types of datasets for predicting whether or not a compound extends the lifespan of C. elegans (the most frequent model organism in DrugAge), using four different types of predictive biological features, based on: compound-protein interactions, interactions between compounds and proteins encoded by ageing-related genes, and two types of terms annotated for proteins targeted by the compounds, namely Gene Ontology (GO) terms and physiology terms from the WormBase’s Phenotype Ontology. To analyse these datasets, we used a combination of feature selection methods in a data pre-processing phase and the well-established random forest algorithm for learning predictive models from the selected features. In addition, we interpreted the most important features in the two best models in light of the biology of ageing. One noteworthy feature was the GO term “Glutathione metabolic process”, which plays an important role in cellular redox homeostasis and detoxification. We also predicted the most promising novel compounds for extending lifespan from a list of previously unlabelled compounds. These include nitroprusside, which is used as an antihypertensive medication. Overall, our work opens avenues for future work in employing machine learning to predict novel life-extending compounds.
format Online
Article
Text
id pubmed-10373959
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Impact Journals
record_format MEDLINE/PubMed
spelling pubmed-103739592023-07-28 Predicting lifespan-extending chemical compounds for C. elegans with machine learning and biologically interpretable features Ribeiro, Caio Farmer, Christopher K. de Magalhães, João Pedro Freitas, Alex A. Aging (Albany NY) Research Paper Recently, there has been a growing interest in the development of pharmacological interventions targeting ageing, as well as in the use of machine learning for analysing ageing-related data. In this work, we use machine learning methods to analyse data from DrugAge, a database of chemical compounds (including drugs) modulating lifespan in model organisms. To this end, we created four types of datasets for predicting whether or not a compound extends the lifespan of C. elegans (the most frequent model organism in DrugAge), using four different types of predictive biological features, based on: compound-protein interactions, interactions between compounds and proteins encoded by ageing-related genes, and two types of terms annotated for proteins targeted by the compounds, namely Gene Ontology (GO) terms and physiology terms from the WormBase’s Phenotype Ontology. To analyse these datasets, we used a combination of feature selection methods in a data pre-processing phase and the well-established random forest algorithm for learning predictive models from the selected features. In addition, we interpreted the most important features in the two best models in light of the biology of ageing. One noteworthy feature was the GO term “Glutathione metabolic process”, which plays an important role in cellular redox homeostasis and detoxification. We also predicted the most promising novel compounds for extending lifespan from a list of previously unlabelled compounds. These include nitroprusside, which is used as an antihypertensive medication. Overall, our work opens avenues for future work in employing machine learning to predict novel life-extending compounds. Impact Journals 2023-07-13 /pmc/articles/PMC10373959/ /pubmed/37450404 http://dx.doi.org/10.18632/aging.204866 Text en Copyright: © 2023 Ribeiro et al. https://creativecommons.org/licenses/by/3.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/3.0/) (CC BY 3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Paper
Ribeiro, Caio
Farmer, Christopher K.
de Magalhães, João Pedro
Freitas, Alex A.
Predicting lifespan-extending chemical compounds for C. elegans with machine learning and biologically interpretable features
title Predicting lifespan-extending chemical compounds for C. elegans with machine learning and biologically interpretable features
title_full Predicting lifespan-extending chemical compounds for C. elegans with machine learning and biologically interpretable features
title_fullStr Predicting lifespan-extending chemical compounds for C. elegans with machine learning and biologically interpretable features
title_full_unstemmed Predicting lifespan-extending chemical compounds for C. elegans with machine learning and biologically interpretable features
title_short Predicting lifespan-extending chemical compounds for C. elegans with machine learning and biologically interpretable features
title_sort predicting lifespan-extending chemical compounds for c. elegans with machine learning and biologically interpretable features
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10373959/
https://www.ncbi.nlm.nih.gov/pubmed/37450404
http://dx.doi.org/10.18632/aging.204866
work_keys_str_mv AT ribeirocaio predictinglifespanextendingchemicalcompoundsforceleganswithmachinelearningandbiologicallyinterpretablefeatures
AT farmerchristopherk predictinglifespanextendingchemicalcompoundsforceleganswithmachinelearningandbiologicallyinterpretablefeatures
AT demagalhaesjoaopedro predictinglifespanextendingchemicalcompoundsforceleganswithmachinelearningandbiologicallyinterpretablefeatures
AT freitasalexa predictinglifespanextendingchemicalcompoundsforceleganswithmachinelearningandbiologicallyinterpretablefeatures