Cargando…

Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset

Drug-induced liver injury (DILI) remains one of the challenges in the safety profile of both authorized and candidate drugs, and predicting hepatotoxicity from the chemical structure of a substance remains a task worth pursuing. Such an approach is coherent with the current tendency for replacing no...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ancuceanu, Robert, Hovanet, Marilena Viorica, Anghel, Adriana Iuliana, Furtunescu, Florentina, Neagu, Monica, Constantin, Carolina, Dinu, Mihaela
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7139829/ https://www.ncbi.nlm.nih.gov/pubmed/32204453 http://dx.doi.org/10.3390/ijms21062114

_version_	1783518856702066688
author	Ancuceanu, Robert Hovanet, Marilena Viorica Anghel, Adriana Iuliana Furtunescu, Florentina Neagu, Monica Constantin, Carolina Dinu, Mihaela
author_facet	Ancuceanu, Robert Hovanet, Marilena Viorica Anghel, Adriana Iuliana Furtunescu, Florentina Neagu, Monica Constantin, Carolina Dinu, Mihaela
author_sort	Ancuceanu, Robert
collection	PubMed
description	Drug-induced liver injury (DILI) remains one of the challenges in the safety profile of both authorized and candidate drugs, and predicting hepatotoxicity from the chemical structure of a substance remains a task worth pursuing. Such an approach is coherent with the current tendency for replacing non-clinical tests with in vitro or in silico alternatives. In 2016, a group of researchers from the FDA published an improved annotated list of drugs with respect to their DILI risk, constituting “the largest reference drug list ranked by the risk for developing drug-induced liver injury in humans” (DILIrank). This paper is one of the few attempting to predict liver toxicity using the DILIrank dataset. Molecular descriptors were computed with the Dragon 7.0 software, and a variety of feature selection and machine learning algorithms were implemented in the R computing environment. Nested (double) cross-validation was used to externally validate the models selected. A total of 78 models with reasonable performance were selected and stacked through several approaches, including the building of multiple meta-models. The performance of the stacked models was slightly superior to other models published. The models were applied in a virtual screening exercise on over 100,000 compounds from the ZINC database and about 20% of them were predicted to be non-hepatotoxic.
format	Online Article Text
id	pubmed-7139829
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-71398292020-04-10 Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset Ancuceanu, Robert Hovanet, Marilena Viorica Anghel, Adriana Iuliana Furtunescu, Florentina Neagu, Monica Constantin, Carolina Dinu, Mihaela Int J Mol Sci Article Drug-induced liver injury (DILI) remains one of the challenges in the safety profile of both authorized and candidate drugs, and predicting hepatotoxicity from the chemical structure of a substance remains a task worth pursuing. Such an approach is coherent with the current tendency for replacing non-clinical tests with in vitro or in silico alternatives. In 2016, a group of researchers from the FDA published an improved annotated list of drugs with respect to their DILI risk, constituting “the largest reference drug list ranked by the risk for developing drug-induced liver injury in humans” (DILIrank). This paper is one of the few attempting to predict liver toxicity using the DILIrank dataset. Molecular descriptors were computed with the Dragon 7.0 software, and a variety of feature selection and machine learning algorithms were implemented in the R computing environment. Nested (double) cross-validation was used to externally validate the models selected. A total of 78 models with reasonable performance were selected and stacked through several approaches, including the building of multiple meta-models. The performance of the stacked models was slightly superior to other models published. The models were applied in a virtual screening exercise on over 100,000 compounds from the ZINC database and about 20% of them were predicted to be non-hepatotoxic. MDPI 2020-03-19 /pmc/articles/PMC7139829/ /pubmed/32204453 http://dx.doi.org/10.3390/ijms21062114 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Ancuceanu, Robert Hovanet, Marilena Viorica Anghel, Adriana Iuliana Furtunescu, Florentina Neagu, Monica Constantin, Carolina Dinu, Mihaela Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset
title	Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset
title_full	Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset
title_fullStr	Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset
title_full_unstemmed	Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset
title_short	Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset
title_sort	computational models using multiple machine learning algorithms for predicting drug hepatotoxicity with the dilirank dataset
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7139829/ https://www.ncbi.nlm.nih.gov/pubmed/32204453 http://dx.doi.org/10.3390/ijms21062114
work_keys_str_mv	AT ancuceanurobert computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset AT hovanetmarilenaviorica computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset AT angheladrianaiuliana computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset AT furtunescuflorentina computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset AT neagumonica computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset AT constantincarolina computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset AT dinumihaela computationalmodelsusingmultiplemachinelearningalgorithmsforpredictingdrughepatotoxicitywiththedilirankdataset

Computational Models Using Multiple Machine Learning Algorithms for Predicting Drug Hepatotoxicity with the DILIrank Dataset

Ejemplares similares