Cargando…

Machine learning models for classification tasks related to drug safety

In this review, we outline the current trends in the field of machine learning-driven classification studies related to ADME (absorption, distribution, metabolism and excretion) and toxicity endpoints from the past six years (2015–2021). The study focuses only on classification models with large dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Rácz, Anita, Bajusz, Dávid, Miranda-Quintana, Ramón Alain, Héberger, Károly
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8342376/
https://www.ncbi.nlm.nih.gov/pubmed/34110577
http://dx.doi.org/10.1007/s11030-021-10239-x
_version_ 1783734055418724352
author Rácz, Anita
Bajusz, Dávid
Miranda-Quintana, Ramón Alain
Héberger, Károly
author_facet Rácz, Anita
Bajusz, Dávid
Miranda-Quintana, Ramón Alain
Héberger, Károly
author_sort Rácz, Anita
collection PubMed
description In this review, we outline the current trends in the field of machine learning-driven classification studies related to ADME (absorption, distribution, metabolism and excretion) and toxicity endpoints from the past six years (2015–2021). The study focuses only on classification models with large datasets (i.e. more than a thousand compounds). A comprehensive literature search and meta-analysis was carried out for nine different targets: hERG-mediated cardiotoxicity, blood–brain barrier penetration, permeability glycoprotein (P-gp) substrate/inhibitor, cytochrome P450 enzyme family, acute oral toxicity, mutagenicity, carcinogenicity, respiratory toxicity and irritation/corrosion. The comparison of the best classification models was targeted to reveal the differences between machine learning algorithms and modeling types, endpoint-specific performances, dataset sizes and the different validation protocols. Based on the evaluation of the data, we can say that tree-based algorithms are (still) dominating the field, with consensus modeling being an increasing trend in drug safety predictions. Although one can already find classification models with great performances to hERG-mediated cardiotoxicity and the isoenzymes of the cytochrome P450 enzyme family, these targets are still central to ADMET-related research efforts. [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11030-021-10239-x.
format Online
Article
Text
id pubmed-8342376
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-83423762021-08-20 Machine learning models for classification tasks related to drug safety Rácz, Anita Bajusz, Dávid Miranda-Quintana, Ramón Alain Héberger, Károly Mol Divers Original Article In this review, we outline the current trends in the field of machine learning-driven classification studies related to ADME (absorption, distribution, metabolism and excretion) and toxicity endpoints from the past six years (2015–2021). The study focuses only on classification models with large datasets (i.e. more than a thousand compounds). A comprehensive literature search and meta-analysis was carried out for nine different targets: hERG-mediated cardiotoxicity, blood–brain barrier penetration, permeability glycoprotein (P-gp) substrate/inhibitor, cytochrome P450 enzyme family, acute oral toxicity, mutagenicity, carcinogenicity, respiratory toxicity and irritation/corrosion. The comparison of the best classification models was targeted to reveal the differences between machine learning algorithms and modeling types, endpoint-specific performances, dataset sizes and the different validation protocols. Based on the evaluation of the data, we can say that tree-based algorithms are (still) dominating the field, with consensus modeling being an increasing trend in drug safety predictions. Although one can already find classification models with great performances to hERG-mediated cardiotoxicity and the isoenzymes of the cytochrome P450 enzyme family, these targets are still central to ADMET-related research efforts. [Image: see text] SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s11030-021-10239-x. Springer International Publishing 2021-06-10 2021 /pmc/articles/PMC8342376/ /pubmed/34110577 http://dx.doi.org/10.1007/s11030-021-10239-x Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Article
Rácz, Anita
Bajusz, Dávid
Miranda-Quintana, Ramón Alain
Héberger, Károly
Machine learning models for classification tasks related to drug safety
title Machine learning models for classification tasks related to drug safety
title_full Machine learning models for classification tasks related to drug safety
title_fullStr Machine learning models for classification tasks related to drug safety
title_full_unstemmed Machine learning models for classification tasks related to drug safety
title_short Machine learning models for classification tasks related to drug safety
title_sort machine learning models for classification tasks related to drug safety
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8342376/
https://www.ncbi.nlm.nih.gov/pubmed/34110577
http://dx.doi.org/10.1007/s11030-021-10239-x
work_keys_str_mv AT raczanita machinelearningmodelsforclassificationtasksrelatedtodrugsafety
AT bajuszdavid machinelearningmodelsforclassificationtasksrelatedtodrugsafety
AT mirandaquintanaramonalain machinelearningmodelsforclassificationtasksrelatedtodrugsafety
AT hebergerkaroly machinelearningmodelsforclassificationtasksrelatedtodrugsafety