Cargando…

Representation learning for clinical time series prediction tasks in electronic health records

BACKGROUND: Electronic health records (EHRs) provide possibilities to improve patient care and facilitate clinical research. However, there are many challenges faced by the applications of EHRs, such as temporality, high dimensionality, sparseness, noise, random error and systematic bias. In particu...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruan, Tong, Lei, Liqi, Zhou, Yangming, Zhai, Jie, Zhang, Le, He, Ping, Gao, Ju
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6916209/
https://www.ncbi.nlm.nih.gov/pubmed/31842854
http://dx.doi.org/10.1186/s12911-019-0985-7
_version_ 1783480185327190016
author Ruan, Tong
Lei, Liqi
Zhou, Yangming
Zhai, Jie
Zhang, Le
He, Ping
Gao, Ju
author_facet Ruan, Tong
Lei, Liqi
Zhou, Yangming
Zhai, Jie
Zhang, Le
He, Ping
Gao, Ju
author_sort Ruan, Tong
collection PubMed
description BACKGROUND: Electronic health records (EHRs) provide possibilities to improve patient care and facilitate clinical research. However, there are many challenges faced by the applications of EHRs, such as temporality, high dimensionality, sparseness, noise, random error and systematic bias. In particular, temporal information is difficult to effectively use by traditional machine learning methods while the sequential information of EHRs is very useful. METHOD: In this paper, we propose a general-purpose patient representation learning approach to summarize sequential EHRs. Specifically, a recurrent neural network based denoising autoencoder (RNN-DAE) is employed to encode inhospital records of each patient into a low dimensional dense vector. RESULTS: Based on EHR data collected from Shuguang Hospital affiliated to Shanghai University of Traditional Chinese Medicine, we experimentally evaluate our proposed RNN-DAE method on both mortality prediction task and comorbidity prediction task. Extensive experimental results show that our proposed RNN-DAE method outperforms existing methods. In addition, we apply the “Deep Feature” represented by our proposed RNN-DAE method to track similar patients with t-SNE, which also achieves some interesting observations. CONCLUSION: We propose an effective unsupervised RNN-DAE method to summarize patient sequential information in EHR data. Our proposed RNN-DAE method is useful on both mortality prediction task and comorbidity prediction task.
format Online
Article
Text
id pubmed-6916209
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69162092019-12-30 Representation learning for clinical time series prediction tasks in electronic health records Ruan, Tong Lei, Liqi Zhou, Yangming Zhai, Jie Zhang, Le He, Ping Gao, Ju BMC Med Inform Decis Mak Research BACKGROUND: Electronic health records (EHRs) provide possibilities to improve patient care and facilitate clinical research. However, there are many challenges faced by the applications of EHRs, such as temporality, high dimensionality, sparseness, noise, random error and systematic bias. In particular, temporal information is difficult to effectively use by traditional machine learning methods while the sequential information of EHRs is very useful. METHOD: In this paper, we propose a general-purpose patient representation learning approach to summarize sequential EHRs. Specifically, a recurrent neural network based denoising autoencoder (RNN-DAE) is employed to encode inhospital records of each patient into a low dimensional dense vector. RESULTS: Based on EHR data collected from Shuguang Hospital affiliated to Shanghai University of Traditional Chinese Medicine, we experimentally evaluate our proposed RNN-DAE method on both mortality prediction task and comorbidity prediction task. Extensive experimental results show that our proposed RNN-DAE method outperforms existing methods. In addition, we apply the “Deep Feature” represented by our proposed RNN-DAE method to track similar patients with t-SNE, which also achieves some interesting observations. CONCLUSION: We propose an effective unsupervised RNN-DAE method to summarize patient sequential information in EHR data. Our proposed RNN-DAE method is useful on both mortality prediction task and comorbidity prediction task. BioMed Central 2019-12-17 /pmc/articles/PMC6916209/ /pubmed/31842854 http://dx.doi.org/10.1186/s12911-019-0985-7 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Ruan, Tong
Lei, Liqi
Zhou, Yangming
Zhai, Jie
Zhang, Le
He, Ping
Gao, Ju
Representation learning for clinical time series prediction tasks in electronic health records
title Representation learning for clinical time series prediction tasks in electronic health records
title_full Representation learning for clinical time series prediction tasks in electronic health records
title_fullStr Representation learning for clinical time series prediction tasks in electronic health records
title_full_unstemmed Representation learning for clinical time series prediction tasks in electronic health records
title_short Representation learning for clinical time series prediction tasks in electronic health records
title_sort representation learning for clinical time series prediction tasks in electronic health records
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6916209/
https://www.ncbi.nlm.nih.gov/pubmed/31842854
http://dx.doi.org/10.1186/s12911-019-0985-7
work_keys_str_mv AT ruantong representationlearningforclinicaltimeseriespredictiontasksinelectronichealthrecords
AT leiliqi representationlearningforclinicaltimeseriespredictiontasksinelectronichealthrecords
AT zhouyangming representationlearningforclinicaltimeseriespredictiontasksinelectronichealthrecords
AT zhaijie representationlearningforclinicaltimeseriespredictiontasksinelectronichealthrecords
AT zhangle representationlearningforclinicaltimeseriespredictiontasksinelectronichealthrecords
AT heping representationlearningforclinicaltimeseriespredictiontasksinelectronichealthrecords
AT gaoju representationlearningforclinicaltimeseriespredictiontasksinelectronichealthrecords