Cargando…

Predictability of drug-induced liver injury by machine learning

BACKGROUND: Drug-induced liver injury (DILI) is a major concern in drug development, as hepatotoxicity may not be apparent at early stages but can lead to life threatening consequences. The ability to predict DILI from in vitro data would be a crucial advantage. In 2018, the Critical Assessment Mass...

Descripción completa

Detalles Bibliográficos
Autores principales: Chierici, Marco, Francescatto, Margherita, Bussola, Nicole, Jurman, Giuseppe, Furlanello, Cesare
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7020573/
https://www.ncbi.nlm.nih.gov/pubmed/32054490
http://dx.doi.org/10.1186/s13062-020-0259-4
_version_ 1783497776109191168
author Chierici, Marco
Francescatto, Margherita
Bussola, Nicole
Jurman, Giuseppe
Furlanello, Cesare
author_facet Chierici, Marco
Francescatto, Margherita
Bussola, Nicole
Jurman, Giuseppe
Furlanello, Cesare
author_sort Chierici, Marco
collection PubMed
description BACKGROUND: Drug-induced liver injury (DILI) is a major concern in drug development, as hepatotoxicity may not be apparent at early stages but can lead to life threatening consequences. The ability to predict DILI from in vitro data would be a crucial advantage. In 2018, the Critical Assessment Massive Data Analysis group proposed the CMap Drug Safety challenge focusing on DILI prediction. METHODS AND RESULTS: The challenge data included Affymetrix GeneChip expression profiles for the two cancer cell lines MCF7 and PC3 treated with 276 drug compounds and empty vehicles. Binary DILI labeling and a recommended train/test split for the development of predictive classification approaches were also provided. We devised three deep learning architectures for DILI prediction on the challenge data and compared them to random forest and multi-layer perceptron classifiers. On a subset of the data and for some of the models we additionally tested several strategies for balancing the two DILI classes and to identify alternative informative train/test splits. All the models were trained with the MAQC data analysis protocol (DAP), i.e., 10x5 cross-validation over the training set. In all the experiments, the classification performance in both cross-validation and external validation gave Matthews correlation coefficient (MCC) values below 0.2. We observed minimal differences between the two cell lines. Notably, deep learning approaches did not give an advantage on the classification performance. DISCUSSION: We extensively tested multiple machine learning approaches for the DILI classification task obtaining poor to mediocre performance. The results suggest that the CMap expression data on the two cell lines MCF7 and PC3 are not sufficient for accurate DILI label prediction. REVIEWERS: This article was reviewed by Maciej Kandula and Paweł P. Labaj.
format Online
Article
Text
id pubmed-7020573
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70205732020-02-20 Predictability of drug-induced liver injury by machine learning Chierici, Marco Francescatto, Margherita Bussola, Nicole Jurman, Giuseppe Furlanello, Cesare Biol Direct Research BACKGROUND: Drug-induced liver injury (DILI) is a major concern in drug development, as hepatotoxicity may not be apparent at early stages but can lead to life threatening consequences. The ability to predict DILI from in vitro data would be a crucial advantage. In 2018, the Critical Assessment Massive Data Analysis group proposed the CMap Drug Safety challenge focusing on DILI prediction. METHODS AND RESULTS: The challenge data included Affymetrix GeneChip expression profiles for the two cancer cell lines MCF7 and PC3 treated with 276 drug compounds and empty vehicles. Binary DILI labeling and a recommended train/test split for the development of predictive classification approaches were also provided. We devised three deep learning architectures for DILI prediction on the challenge data and compared them to random forest and multi-layer perceptron classifiers. On a subset of the data and for some of the models we additionally tested several strategies for balancing the two DILI classes and to identify alternative informative train/test splits. All the models were trained with the MAQC data analysis protocol (DAP), i.e., 10x5 cross-validation over the training set. In all the experiments, the classification performance in both cross-validation and external validation gave Matthews correlation coefficient (MCC) values below 0.2. We observed minimal differences between the two cell lines. Notably, deep learning approaches did not give an advantage on the classification performance. DISCUSSION: We extensively tested multiple machine learning approaches for the DILI classification task obtaining poor to mediocre performance. The results suggest that the CMap expression data on the two cell lines MCF7 and PC3 are not sufficient for accurate DILI label prediction. REVIEWERS: This article was reviewed by Maciej Kandula and Paweł P. Labaj. BioMed Central 2020-02-13 /pmc/articles/PMC7020573/ /pubmed/32054490 http://dx.doi.org/10.1186/s13062-020-0259-4 Text en © The Author(s) 2020 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Chierici, Marco
Francescatto, Margherita
Bussola, Nicole
Jurman, Giuseppe
Furlanello, Cesare
Predictability of drug-induced liver injury by machine learning
title Predictability of drug-induced liver injury by machine learning
title_full Predictability of drug-induced liver injury by machine learning
title_fullStr Predictability of drug-induced liver injury by machine learning
title_full_unstemmed Predictability of drug-induced liver injury by machine learning
title_short Predictability of drug-induced liver injury by machine learning
title_sort predictability of drug-induced liver injury by machine learning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7020573/
https://www.ncbi.nlm.nih.gov/pubmed/32054490
http://dx.doi.org/10.1186/s13062-020-0259-4
work_keys_str_mv AT chiericimarco predictabilityofdruginducedliverinjurybymachinelearning
AT francescattomargherita predictabilityofdruginducedliverinjurybymachinelearning
AT bussolanicole predictabilityofdruginducedliverinjurybymachinelearning
AT jurmangiuseppe predictabilityofdruginducedliverinjurybymachinelearning
AT furlanellocesare predictabilityofdruginducedliverinjurybymachinelearning