Cargando…

A classification framework for exploiting sparse multi-variate temporal features with application to adverse drug event detection in medical records

BACKGROUND: Adverse drug events (ADEs) as well as other preventable adverse events in the hospital setting incur a yearly monetary cost of approximately $3.5 billion, in the United States alone. Therefore, it is of paramount importance to reduce the impact and prevalence of ADEs within the healthcar...

Descripción completa

Detalles Bibliográficos
Autores principales: Bagattini, Francesco, Karlsson, Isak, Rebane, Jonathan, Papapetrou, Panagiotis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6327495/
https://www.ncbi.nlm.nih.gov/pubmed/30630486
http://dx.doi.org/10.1186/s12911-018-0717-4
_version_ 1783386478732115968
author Bagattini, Francesco
Karlsson, Isak
Rebane, Jonathan
Papapetrou, Panagiotis
author_facet Bagattini, Francesco
Karlsson, Isak
Rebane, Jonathan
Papapetrou, Panagiotis
author_sort Bagattini, Francesco
collection PubMed
description BACKGROUND: Adverse drug events (ADEs) as well as other preventable adverse events in the hospital setting incur a yearly monetary cost of approximately $3.5 billion, in the United States alone. Therefore, it is of paramount importance to reduce the impact and prevalence of ADEs within the healthcare sector, not only since it will result in reducing human suffering, but also as a means to substantially reduce economical strains on the healthcare system. One approach to mitigate this problem is to employ predictive models. While existing methods have been focusing on the exploitation of static features, limited attention has been given to temporal features. METHODS: In this paper, we present a novel classification framework for detecting ADEs in complex Electronic health records (EHRs) by exploiting the temporality and sparsity of the underlying features. The proposed framework consists of three phases for transforming sparse and multi-variate time series features into a single-valued feature representation, which can then be used by any classifier. Moreover, we propose and evaluate three different strategies for leveraging feature sparsity by incorporating it into the new representation. RESULTS: A large-scale evaluation on 15 ADE datasets extracted from a real-world EHR system shows that the proposed framework achieves significantly improved predictive performance compared to state-of-the-art. Moreover, our framework can reveal features that are clinically consistent with medical findings on ADE detection. CONCLUSIONS: Our study and experimental findings demonstrate that temporal multi-variate features of variable length and with high sparsity can be effectively utilized to predict ADEs from EHRs. Two key advantages of our framework are that it is method agnostic, i.e., versatile, and of low computational cost, i.e., fast; hence providing an important building block for future exploitation within the domain of machine learning from EHRs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12911-018-0717-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6327495
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63274952019-01-15 A classification framework for exploiting sparse multi-variate temporal features with application to adverse drug event detection in medical records Bagattini, Francesco Karlsson, Isak Rebane, Jonathan Papapetrou, Panagiotis BMC Med Inform Decis Mak Research Article BACKGROUND: Adverse drug events (ADEs) as well as other preventable adverse events in the hospital setting incur a yearly monetary cost of approximately $3.5 billion, in the United States alone. Therefore, it is of paramount importance to reduce the impact and prevalence of ADEs within the healthcare sector, not only since it will result in reducing human suffering, but also as a means to substantially reduce economical strains on the healthcare system. One approach to mitigate this problem is to employ predictive models. While existing methods have been focusing on the exploitation of static features, limited attention has been given to temporal features. METHODS: In this paper, we present a novel classification framework for detecting ADEs in complex Electronic health records (EHRs) by exploiting the temporality and sparsity of the underlying features. The proposed framework consists of three phases for transforming sparse and multi-variate time series features into a single-valued feature representation, which can then be used by any classifier. Moreover, we propose and evaluate three different strategies for leveraging feature sparsity by incorporating it into the new representation. RESULTS: A large-scale evaluation on 15 ADE datasets extracted from a real-world EHR system shows that the proposed framework achieves significantly improved predictive performance compared to state-of-the-art. Moreover, our framework can reveal features that are clinically consistent with medical findings on ADE detection. CONCLUSIONS: Our study and experimental findings demonstrate that temporal multi-variate features of variable length and with high sparsity can be effectively utilized to predict ADEs from EHRs. Two key advantages of our framework are that it is method agnostic, i.e., versatile, and of low computational cost, i.e., fast; hence providing an important building block for future exploitation within the domain of machine learning from EHRs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12911-018-0717-4) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-10 /pmc/articles/PMC6327495/ /pubmed/30630486 http://dx.doi.org/10.1186/s12911-018-0717-4 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Bagattini, Francesco
Karlsson, Isak
Rebane, Jonathan
Papapetrou, Panagiotis
A classification framework for exploiting sparse multi-variate temporal features with application to adverse drug event detection in medical records
title A classification framework for exploiting sparse multi-variate temporal features with application to adverse drug event detection in medical records
title_full A classification framework for exploiting sparse multi-variate temporal features with application to adverse drug event detection in medical records
title_fullStr A classification framework for exploiting sparse multi-variate temporal features with application to adverse drug event detection in medical records
title_full_unstemmed A classification framework for exploiting sparse multi-variate temporal features with application to adverse drug event detection in medical records
title_short A classification framework for exploiting sparse multi-variate temporal features with application to adverse drug event detection in medical records
title_sort classification framework for exploiting sparse multi-variate temporal features with application to adverse drug event detection in medical records
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6327495/
https://www.ncbi.nlm.nih.gov/pubmed/30630486
http://dx.doi.org/10.1186/s12911-018-0717-4
work_keys_str_mv AT bagattinifrancesco aclassificationframeworkforexploitingsparsemultivariatetemporalfeatureswithapplicationtoadversedrugeventdetectioninmedicalrecords
AT karlssonisak aclassificationframeworkforexploitingsparsemultivariatetemporalfeatureswithapplicationtoadversedrugeventdetectioninmedicalrecords
AT rebanejonathan aclassificationframeworkforexploitingsparsemultivariatetemporalfeatureswithapplicationtoadversedrugeventdetectioninmedicalrecords
AT papapetroupanagiotis aclassificationframeworkforexploitingsparsemultivariatetemporalfeatureswithapplicationtoadversedrugeventdetectioninmedicalrecords
AT bagattinifrancesco classificationframeworkforexploitingsparsemultivariatetemporalfeatureswithapplicationtoadversedrugeventdetectioninmedicalrecords
AT karlssonisak classificationframeworkforexploitingsparsemultivariatetemporalfeatureswithapplicationtoadversedrugeventdetectioninmedicalrecords
AT rebanejonathan classificationframeworkforexploitingsparsemultivariatetemporalfeatureswithapplicationtoadversedrugeventdetectioninmedicalrecords
AT papapetroupanagiotis classificationframeworkforexploitingsparsemultivariatetemporalfeatureswithapplicationtoadversedrugeventdetectioninmedicalrecords