Cargando…

An ensemble learning approach for modeling the systems biology of drug-induced injury

BACKGROUND: Drug-induced liver injury (DILI) is an adverse reaction caused by the intake of drugs of common use that produces liver damage. The impact of DILI is estimated to affect around 20 in 100,000 inhabitants worldwide each year. Despite being one of the main causes of liver failure, the patho...

Descripción completa

Detalles Bibliográficos
Autores principales: Aguirre-Plans, Joaquim, Piñero, Janet, Souza, Terezinha, Callegaro, Giulia, Kunnen, Steven J., Sanz, Ferran, Fernandez-Fuentes, Narcis, Furlong, Laura I., Guney, Emre, Oliva, Baldo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805064/
https://www.ncbi.nlm.nih.gov/pubmed/33435983
http://dx.doi.org/10.1186/s13062-020-00288-x
_version_ 1783636242696503296
author Aguirre-Plans, Joaquim
Piñero, Janet
Souza, Terezinha
Callegaro, Giulia
Kunnen, Steven J.
Sanz, Ferran
Fernandez-Fuentes, Narcis
Furlong, Laura I.
Guney, Emre
Oliva, Baldo
author_facet Aguirre-Plans, Joaquim
Piñero, Janet
Souza, Terezinha
Callegaro, Giulia
Kunnen, Steven J.
Sanz, Ferran
Fernandez-Fuentes, Narcis
Furlong, Laura I.
Guney, Emre
Oliva, Baldo
author_sort Aguirre-Plans, Joaquim
collection PubMed
description BACKGROUND: Drug-induced liver injury (DILI) is an adverse reaction caused by the intake of drugs of common use that produces liver damage. The impact of DILI is estimated to affect around 20 in 100,000 inhabitants worldwide each year. Despite being one of the main causes of liver failure, the pathophysiology and mechanisms of DILI are poorly understood. In the present study, we developed an ensemble learning approach based on different features (CMap gene expression, chemical structures, drug targets) to predict drugs that might cause DILI and gain a better understanding of the mechanisms linked to the adverse reaction. RESULTS: We searched for gene signatures in CMap gene expression data by using two approaches: phenotype-gene associations data from DisGeNET, and a non-parametric test comparing gene expression of DILI-Concern and No-DILI-Concern drugs (as per DILIrank definitions). The average accuracy of the classifiers in both approaches was 69%. We used chemical structures as features, obtaining an accuracy of 65%. The combination of both types of features produced an accuracy around 63%, but improved the independent hold-out test up to 67%. The use of drug-target associations as feature obtained the best accuracy (70%) in the independent hold-out test. CONCLUSIONS: When using CMap gene expression data, searching for a specific gene signature among the landmark genes improves the quality of the classifiers, but it is still limited by the intrinsic noise of the dataset. When using chemical structures as a feature, the structural diversity of the known DILI-causing drugs hampers the prediction, which is a similar problem as for the use of gene expression information. The combination of both features did not improve the quality of the classifiers but increased the robustness as shown on independent hold-out tests. The use of drug-target associations as feature improved the prediction, specially the specificity, and the results were comparable to previous research studies. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13062-020-00288-x.
format Online
Article
Text
id pubmed-7805064
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-78050642021-01-14 An ensemble learning approach for modeling the systems biology of drug-induced injury Aguirre-Plans, Joaquim Piñero, Janet Souza, Terezinha Callegaro, Giulia Kunnen, Steven J. Sanz, Ferran Fernandez-Fuentes, Narcis Furlong, Laura I. Guney, Emre Oliva, Baldo Biol Direct Research BACKGROUND: Drug-induced liver injury (DILI) is an adverse reaction caused by the intake of drugs of common use that produces liver damage. The impact of DILI is estimated to affect around 20 in 100,000 inhabitants worldwide each year. Despite being one of the main causes of liver failure, the pathophysiology and mechanisms of DILI are poorly understood. In the present study, we developed an ensemble learning approach based on different features (CMap gene expression, chemical structures, drug targets) to predict drugs that might cause DILI and gain a better understanding of the mechanisms linked to the adverse reaction. RESULTS: We searched for gene signatures in CMap gene expression data by using two approaches: phenotype-gene associations data from DisGeNET, and a non-parametric test comparing gene expression of DILI-Concern and No-DILI-Concern drugs (as per DILIrank definitions). The average accuracy of the classifiers in both approaches was 69%. We used chemical structures as features, obtaining an accuracy of 65%. The combination of both types of features produced an accuracy around 63%, but improved the independent hold-out test up to 67%. The use of drug-target associations as feature obtained the best accuracy (70%) in the independent hold-out test. CONCLUSIONS: When using CMap gene expression data, searching for a specific gene signature among the landmark genes improves the quality of the classifiers, but it is still limited by the intrinsic noise of the dataset. When using chemical structures as a feature, the structural diversity of the known DILI-causing drugs hampers the prediction, which is a similar problem as for the use of gene expression information. The combination of both features did not improve the quality of the classifiers but increased the robustness as shown on independent hold-out tests. The use of drug-target associations as feature improved the prediction, specially the specificity, and the results were comparable to previous research studies. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13062-020-00288-x. BioMed Central 2021-01-12 /pmc/articles/PMC7805064/ /pubmed/33435983 http://dx.doi.org/10.1186/s13062-020-00288-x Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Aguirre-Plans, Joaquim
Piñero, Janet
Souza, Terezinha
Callegaro, Giulia
Kunnen, Steven J.
Sanz, Ferran
Fernandez-Fuentes, Narcis
Furlong, Laura I.
Guney, Emre
Oliva, Baldo
An ensemble learning approach for modeling the systems biology of drug-induced injury
title An ensemble learning approach for modeling the systems biology of drug-induced injury
title_full An ensemble learning approach for modeling the systems biology of drug-induced injury
title_fullStr An ensemble learning approach for modeling the systems biology of drug-induced injury
title_full_unstemmed An ensemble learning approach for modeling the systems biology of drug-induced injury
title_short An ensemble learning approach for modeling the systems biology of drug-induced injury
title_sort ensemble learning approach for modeling the systems biology of drug-induced injury
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805064/
https://www.ncbi.nlm.nih.gov/pubmed/33435983
http://dx.doi.org/10.1186/s13062-020-00288-x
work_keys_str_mv AT aguirreplansjoaquim anensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT pinerojanet anensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT souzaterezinha anensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT callegarogiulia anensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT kunnenstevenj anensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT sanzferran anensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT fernandezfuentesnarcis anensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT furlonglaurai anensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT guneyemre anensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT olivabaldo anensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT aguirreplansjoaquim ensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT pinerojanet ensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT souzaterezinha ensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT callegarogiulia ensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT kunnenstevenj ensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT sanzferran ensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT fernandezfuentesnarcis ensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT furlonglaurai ensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT guneyemre ensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury
AT olivabaldo ensemblelearningapproachformodelingthesystemsbiologyofdruginducedinjury