Cargando…

DILI( C ): An AI-Based Classifier to Search for Drug-Induced Liver Injury Literature

Drug-induced liver injury (DILI) is a class of adverse drug reactions (ADR) that causes problems in both clinical and research settings. It is the most frequent cause of acute liver failure in the majority of Western countries and is a major cause of attrition of novel drug candidates. Manual trawli...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rathee, Sanjay, MacMahon, Meabh, Liu, Anika, Katritsis, Nicholas M., Youssef, Gehad, Hwang, Woochang, Wollman, Lilly, Han, Namshik
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Genetics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9277181/ https://www.ncbi.nlm.nih.gov/pubmed/35846129 http://dx.doi.org/10.3389/fgene.2022.867946

_version_	1784745898719313920
author	Rathee, Sanjay MacMahon, Meabh Liu, Anika Katritsis, Nicholas M. Youssef, Gehad Hwang, Woochang Wollman, Lilly Han, Namshik
author_facet	Rathee, Sanjay MacMahon, Meabh Liu, Anika Katritsis, Nicholas M. Youssef, Gehad Hwang, Woochang Wollman, Lilly Han, Namshik
author_sort	Rathee, Sanjay
collection	PubMed
description	Drug-induced liver injury (DILI) is a class of adverse drug reactions (ADR) that causes problems in both clinical and research settings. It is the most frequent cause of acute liver failure in the majority of Western countries and is a major cause of attrition of novel drug candidates. Manual trawling of the literature is the main route of deriving information on DILI from research studies. This makes it an inefficient process prone to human error. Therefore, an automatized AI model capable of retrieving DILI-related articles from the huge ocean of literature could be invaluable for the drug discovery community. In this study, we built an artificial intelligence (AI) model combining the power of natural language processing (NLP) and machine learning (ML) to address this problem. This model uses NLP to filter out meaningless text (e.g., stop words) and uses customized functions to extract relevant keywords such as singleton, pair, and triplet. These keywords are processed by an apriori pattern mining algorithm to extract relevant patterns which are used to estimate initial weightings for a ML classifier. Along with pattern importance and frequency, an FDA-approved drug list mentioning DILI adds extra confidence in classification. The combined power of these methods builds a DILI classifier (DILI( C )), with 94.91% cross-validation and 94.14% external validation accuracy. To make DILI( C ) as accessible as possible, including to researchers without coding experience, an R Shiny app capable of classifying single or multiple entries for DILI is developed to enhance ease of user experience and made available at https://researchmind.co.uk/diliclassifier/. Additionally, a GitHub link (https://github.com/sanjaysinghrathi/DILI-Classifier) for app source code and ISMB extended video talk (https://www.youtube.com/watch?v=j305yIVi_f8) are available as supplementary materials.
format	Online Article Text
id	pubmed-9277181
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-92771812022-07-14 DILI( C ): An AI-Based Classifier to Search for Drug-Induced Liver Injury Literature Rathee, Sanjay MacMahon, Meabh Liu, Anika Katritsis, Nicholas M. Youssef, Gehad Hwang, Woochang Wollman, Lilly Han, Namshik Front Genet Genetics Drug-induced liver injury (DILI) is a class of adverse drug reactions (ADR) that causes problems in both clinical and research settings. It is the most frequent cause of acute liver failure in the majority of Western countries and is a major cause of attrition of novel drug candidates. Manual trawling of the literature is the main route of deriving information on DILI from research studies. This makes it an inefficient process prone to human error. Therefore, an automatized AI model capable of retrieving DILI-related articles from the huge ocean of literature could be invaluable for the drug discovery community. In this study, we built an artificial intelligence (AI) model combining the power of natural language processing (NLP) and machine learning (ML) to address this problem. This model uses NLP to filter out meaningless text (e.g., stop words) and uses customized functions to extract relevant keywords such as singleton, pair, and triplet. These keywords are processed by an apriori pattern mining algorithm to extract relevant patterns which are used to estimate initial weightings for a ML classifier. Along with pattern importance and frequency, an FDA-approved drug list mentioning DILI adds extra confidence in classification. The combined power of these methods builds a DILI classifier (DILI( C )), with 94.91% cross-validation and 94.14% external validation accuracy. To make DILI( C ) as accessible as possible, including to researchers without coding experience, an R Shiny app capable of classifying single or multiple entries for DILI is developed to enhance ease of user experience and made available at https://researchmind.co.uk/diliclassifier/. Additionally, a GitHub link (https://github.com/sanjaysinghrathi/DILI-Classifier) for app source code and ISMB extended video talk (https://www.youtube.com/watch?v=j305yIVi_f8) are available as supplementary materials. Frontiers Media S.A. 2022-06-29 /pmc/articles/PMC9277181/ /pubmed/35846129 http://dx.doi.org/10.3389/fgene.2022.867946 Text en Copyright © 2022 Rathee, MacMahon, Liu, Katritsis, Youssef, Hwang, Wollman and Han. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Genetics Rathee, Sanjay MacMahon, Meabh Liu, Anika Katritsis, Nicholas M. Youssef, Gehad Hwang, Woochang Wollman, Lilly Han, Namshik DILI( C ): An AI-Based Classifier to Search for Drug-Induced Liver Injury Literature
title	DILI( C ): An AI-Based Classifier to Search for Drug-Induced Liver Injury Literature
title_full	DILI( C ): An AI-Based Classifier to Search for Drug-Induced Liver Injury Literature
title_fullStr	DILI( C ): An AI-Based Classifier to Search for Drug-Induced Liver Injury Literature
title_full_unstemmed	DILI( C ): An AI-Based Classifier to Search for Drug-Induced Liver Injury Literature
title_short	DILI( C ): An AI-Based Classifier to Search for Drug-Induced Liver Injury Literature
title_sort	dili( c ): an ai-based classifier to search for drug-induced liver injury literature
topic	Genetics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9277181/ https://www.ncbi.nlm.nih.gov/pubmed/35846129 http://dx.doi.org/10.3389/fgene.2022.867946
work_keys_str_mv	AT ratheesanjay dilicanaibasedclassifiertosearchfordruginducedliverinjuryliterature AT macmahonmeabh dilicanaibasedclassifiertosearchfordruginducedliverinjuryliterature AT liuanika dilicanaibasedclassifiertosearchfordruginducedliverinjuryliterature AT katritsisnicholasm dilicanaibasedclassifiertosearchfordruginducedliverinjuryliterature AT youssefgehad dilicanaibasedclassifiertosearchfordruginducedliverinjuryliterature AT hwangwoochang dilicanaibasedclassifiertosearchfordruginducedliverinjuryliterature AT wollmanlilly dilicanaibasedclassifiertosearchfordruginducedliverinjuryliterature AT hannamshik dilicanaibasedclassifiertosearchfordruginducedliverinjuryliterature

DILI( C ): An AI-Based Classifier to Search for Drug-Induced Liver Injury Literature

Ejemplares similares