Cargando…

Machine learning model combining features from algorithms with different analytical methodologies to detect laboratory-event-related adverse drug reaction signals

BACKGROUND: The importance of identifying and evaluating adverse drug reactions (ADRs) has been widely recognized. Many studies have developed algorithms for ADR signal detection using electronic health record (EHR) data. In this study, we propose a machine learning (ML) model that enables accurate...

Descripción completa

Detalles Bibliográficos
Autores principales: Jeong, Eugene, Park, Namgi, Choi, Young, Park, Rae Woong, Yoon, Dukyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6248973/
https://www.ncbi.nlm.nih.gov/pubmed/30462745
http://dx.doi.org/10.1371/journal.pone.0207749
_version_ 1783372663528357888
author Jeong, Eugene
Park, Namgi
Choi, Young
Park, Rae Woong
Yoon, Dukyong
author_facet Jeong, Eugene
Park, Namgi
Choi, Young
Park, Rae Woong
Yoon, Dukyong
author_sort Jeong, Eugene
collection PubMed
description BACKGROUND: The importance of identifying and evaluating adverse drug reactions (ADRs) has been widely recognized. Many studies have developed algorithms for ADR signal detection using electronic health record (EHR) data. In this study, we propose a machine learning (ML) model that enables accurate ADR signal detection by integrating features from existing algorithms based on inpatient EHR laboratory results. MATERIALS AND METHODS: To construct an ADR reference dataset, we extracted known drug–laboratory event pairs represented by a laboratory test from the EU-SPC and SIDER databases. All possible drug–laboratory event pairs, except known ones, are considered unknown. To detect a known drug–laboratory event pair, three existing algorithms—CERT, CLEAR, and PACE—were applied to 21-year inpatient EHR data. We also constructed ML models (based on random forest, L1 regularized logistic regression, support vector machine, and a neural network) that use the intermediate products of the CERT, CLEAR, and PACE algorithms as inputs and determine whether a drug–laboratory event pair is associated. For performance comparison, we evaluated the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), F1-measure, and area under receiver operating characteristic (AUROC). RESULTS: All measures of ML models outperformed those of existing algorithms with sensitivity of 0.593–0.793, specificity of 0.619–0.796, NPV of 0.645–0.727, PPV of 0.680–0.777, F1-measure of 0.629–0.709, and AUROC of 0.737–0.816. Features related to change or distribution of shape were considered important for detecting ADR signals. CONCLUSIONS: Improved performance of ML models indicated that applying our model to EHR data is feasible and promising for detecting more accurate and comprehensive ADR signals.
format Online
Article
Text
id pubmed-6248973
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62489732018-12-06 Machine learning model combining features from algorithms with different analytical methodologies to detect laboratory-event-related adverse drug reaction signals Jeong, Eugene Park, Namgi Choi, Young Park, Rae Woong Yoon, Dukyong PLoS One Research Article BACKGROUND: The importance of identifying and evaluating adverse drug reactions (ADRs) has been widely recognized. Many studies have developed algorithms for ADR signal detection using electronic health record (EHR) data. In this study, we propose a machine learning (ML) model that enables accurate ADR signal detection by integrating features from existing algorithms based on inpatient EHR laboratory results. MATERIALS AND METHODS: To construct an ADR reference dataset, we extracted known drug–laboratory event pairs represented by a laboratory test from the EU-SPC and SIDER databases. All possible drug–laboratory event pairs, except known ones, are considered unknown. To detect a known drug–laboratory event pair, three existing algorithms—CERT, CLEAR, and PACE—were applied to 21-year inpatient EHR data. We also constructed ML models (based on random forest, L1 regularized logistic regression, support vector machine, and a neural network) that use the intermediate products of the CERT, CLEAR, and PACE algorithms as inputs and determine whether a drug–laboratory event pair is associated. For performance comparison, we evaluated the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), F1-measure, and area under receiver operating characteristic (AUROC). RESULTS: All measures of ML models outperformed those of existing algorithms with sensitivity of 0.593–0.793, specificity of 0.619–0.796, NPV of 0.645–0.727, PPV of 0.680–0.777, F1-measure of 0.629–0.709, and AUROC of 0.737–0.816. Features related to change or distribution of shape were considered important for detecting ADR signals. CONCLUSIONS: Improved performance of ML models indicated that applying our model to EHR data is feasible and promising for detecting more accurate and comprehensive ADR signals. Public Library of Science 2018-11-21 /pmc/articles/PMC6248973/ /pubmed/30462745 http://dx.doi.org/10.1371/journal.pone.0207749 Text en © 2018 Jeong et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Jeong, Eugene
Park, Namgi
Choi, Young
Park, Rae Woong
Yoon, Dukyong
Machine learning model combining features from algorithms with different analytical methodologies to detect laboratory-event-related adverse drug reaction signals
title Machine learning model combining features from algorithms with different analytical methodologies to detect laboratory-event-related adverse drug reaction signals
title_full Machine learning model combining features from algorithms with different analytical methodologies to detect laboratory-event-related adverse drug reaction signals
title_fullStr Machine learning model combining features from algorithms with different analytical methodologies to detect laboratory-event-related adverse drug reaction signals
title_full_unstemmed Machine learning model combining features from algorithms with different analytical methodologies to detect laboratory-event-related adverse drug reaction signals
title_short Machine learning model combining features from algorithms with different analytical methodologies to detect laboratory-event-related adverse drug reaction signals
title_sort machine learning model combining features from algorithms with different analytical methodologies to detect laboratory-event-related adverse drug reaction signals
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6248973/
https://www.ncbi.nlm.nih.gov/pubmed/30462745
http://dx.doi.org/10.1371/journal.pone.0207749
work_keys_str_mv AT jeongeugene machinelearningmodelcombiningfeaturesfromalgorithmswithdifferentanalyticalmethodologiestodetectlaboratoryeventrelatedadversedrugreactionsignals
AT parknamgi machinelearningmodelcombiningfeaturesfromalgorithmswithdifferentanalyticalmethodologiestodetectlaboratoryeventrelatedadversedrugreactionsignals
AT choiyoung machinelearningmodelcombiningfeaturesfromalgorithmswithdifferentanalyticalmethodologiestodetectlaboratoryeventrelatedadversedrugreactionsignals
AT parkraewoong machinelearningmodelcombiningfeaturesfromalgorithmswithdifferentanalyticalmethodologiestodetectlaboratoryeventrelatedadversedrugreactionsignals
AT yoondukyong machinelearningmodelcombiningfeaturesfromalgorithmswithdifferentanalyticalmethodologiestodetectlaboratoryeventrelatedadversedrugreactionsignals