Cargando…

An interpretable risk prediction model for healthcare with pattern attention

BACKGROUND: The availability of massive amount of data enables the possibility of clinical predictive tasks. Deep learning methods have achieved promising performance on the tasks. However, most existing methods suffer from three limitations: (1) There are lots of missing value for real value events...

Descripción completa

Detalles Bibliográficos
Autores principales: Kamal, Sundreen Asad, Yin, Changchang, Qian, Buyue, Zhang, Ping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7772928/
https://www.ncbi.nlm.nih.gov/pubmed/33380322
http://dx.doi.org/10.1186/s12911-020-01331-7
_version_ 1783629966001307648
author Kamal, Sundreen Asad
Yin, Changchang
Qian, Buyue
Zhang, Ping
author_facet Kamal, Sundreen Asad
Yin, Changchang
Qian, Buyue
Zhang, Ping
author_sort Kamal, Sundreen Asad
collection PubMed
description BACKGROUND: The availability of massive amount of data enables the possibility of clinical predictive tasks. Deep learning methods have achieved promising performance on the tasks. However, most existing methods suffer from three limitations: (1) There are lots of missing value for real value events, many methods impute the missing value and then train their models based on the imputed values, which may introduce imputation bias. The models’ performance is highly dependent on the imputation accuracy. (2) Lots of existing studies just take Boolean value medical events (e.g. diagnosis code) as inputs, but ignore real value medical events (e.g., lab tests and vital signs), which are more important for acute disease (e.g., sepsis) and mortality prediction. (3) Existing interpretable models can illustrate which medical events are conducive to the output results, but are not able to give contributions of patterns among medical events. METHODS: In this study, we propose a novel interpretable Pattern Attention model with Value Embedding (PAVE) to predict the risks of certain diseases. PAVE takes the embedding of various medical events, their values and the corresponding occurring time as inputs, leverage self-attention mechanism to attend to meaningful patterns among medical events for risk prediction tasks. Because only the observed values are embedded into vectors, we don’t need to impute the missing values and thus avoids the imputations bias. Moreover, the self-attention mechanism is helpful for the model interpretability, which means the proposed model can output which patterns cause high risks. RESULTS: We conduct sepsis onset prediction and mortality prediction experiments on a publicly available dataset MIMIC-III and our proprietary EHR dataset. The experimental results show that PAVE outperforms existing models. Moreover, by analyzing the self-attention weights, our model outputs meaningful medical event patterns related to mortality. CONCLUSIONS: PAVE learns effective medical event representation by incorporating the values and occurring time, which can improve the risk prediction performance. Moreover, the presented self-attention mechanism can not only capture patients’ health state information, but also output the contributions of various medical event patterns, which pave the way for interpretable clinical risk predictions. AVAILABILITY: The code for this paper is available at: https://github.com/yinchangchang/PAVE.
format Online
Article
Text
id pubmed-7772928
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-77729282020-12-30 An interpretable risk prediction model for healthcare with pattern attention Kamal, Sundreen Asad Yin, Changchang Qian, Buyue Zhang, Ping BMC Med Inform Decis Mak Research BACKGROUND: The availability of massive amount of data enables the possibility of clinical predictive tasks. Deep learning methods have achieved promising performance on the tasks. However, most existing methods suffer from three limitations: (1) There are lots of missing value for real value events, many methods impute the missing value and then train their models based on the imputed values, which may introduce imputation bias. The models’ performance is highly dependent on the imputation accuracy. (2) Lots of existing studies just take Boolean value medical events (e.g. diagnosis code) as inputs, but ignore real value medical events (e.g., lab tests and vital signs), which are more important for acute disease (e.g., sepsis) and mortality prediction. (3) Existing interpretable models can illustrate which medical events are conducive to the output results, but are not able to give contributions of patterns among medical events. METHODS: In this study, we propose a novel interpretable Pattern Attention model with Value Embedding (PAVE) to predict the risks of certain diseases. PAVE takes the embedding of various medical events, their values and the corresponding occurring time as inputs, leverage self-attention mechanism to attend to meaningful patterns among medical events for risk prediction tasks. Because only the observed values are embedded into vectors, we don’t need to impute the missing values and thus avoids the imputations bias. Moreover, the self-attention mechanism is helpful for the model interpretability, which means the proposed model can output which patterns cause high risks. RESULTS: We conduct sepsis onset prediction and mortality prediction experiments on a publicly available dataset MIMIC-III and our proprietary EHR dataset. The experimental results show that PAVE outperforms existing models. Moreover, by analyzing the self-attention weights, our model outputs meaningful medical event patterns related to mortality. CONCLUSIONS: PAVE learns effective medical event representation by incorporating the values and occurring time, which can improve the risk prediction performance. Moreover, the presented self-attention mechanism can not only capture patients’ health state information, but also output the contributions of various medical event patterns, which pave the way for interpretable clinical risk predictions. AVAILABILITY: The code for this paper is available at: https://github.com/yinchangchang/PAVE. BioMed Central 2020-12-30 /pmc/articles/PMC7772928/ /pubmed/33380322 http://dx.doi.org/10.1186/s12911-020-01331-7 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Kamal, Sundreen Asad
Yin, Changchang
Qian, Buyue
Zhang, Ping
An interpretable risk prediction model for healthcare with pattern attention
title An interpretable risk prediction model for healthcare with pattern attention
title_full An interpretable risk prediction model for healthcare with pattern attention
title_fullStr An interpretable risk prediction model for healthcare with pattern attention
title_full_unstemmed An interpretable risk prediction model for healthcare with pattern attention
title_short An interpretable risk prediction model for healthcare with pattern attention
title_sort interpretable risk prediction model for healthcare with pattern attention
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7772928/
https://www.ncbi.nlm.nih.gov/pubmed/33380322
http://dx.doi.org/10.1186/s12911-020-01331-7
work_keys_str_mv AT kamalsundreenasad aninterpretableriskpredictionmodelforhealthcarewithpatternattention
AT yinchangchang aninterpretableriskpredictionmodelforhealthcarewithpatternattention
AT qianbuyue aninterpretableriskpredictionmodelforhealthcarewithpatternattention
AT zhangping aninterpretableriskpredictionmodelforhealthcarewithpatternattention
AT kamalsundreenasad interpretableriskpredictionmodelforhealthcarewithpatternattention
AT yinchangchang interpretableriskpredictionmodelforhealthcarewithpatternattention
AT qianbuyue interpretableriskpredictionmodelforhealthcarewithpatternattention
AT zhangping interpretableriskpredictionmodelforhealthcarewithpatternattention