Cargando…

Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups

Electronic health records (EHRs) are used in hospitals to store diagnoses, clinician notes, examinations, lab results, and interventions for each patient. Grouping patients into distinct subsets, for example, via clustering, may enable the discovery of unknown disease patterns or comorbidities, whic...

Descripción completa

Detalles Bibliográficos
Autores principales: Merkelbach, Kilian, Schaper, Steffen, Diedrich, Christian, Fritsch, Sebastian Johannes, Schuppert, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10008580/
https://www.ncbi.nlm.nih.gov/pubmed/36906642
http://dx.doi.org/10.1038/s41598-023-30986-1
_version_ 1784905789273538560
author Merkelbach, Kilian
Schaper, Steffen
Diedrich, Christian
Fritsch, Sebastian Johannes
Schuppert, Andreas
author_facet Merkelbach, Kilian
Schaper, Steffen
Diedrich, Christian
Fritsch, Sebastian Johannes
Schuppert, Andreas
author_sort Merkelbach, Kilian
collection PubMed
description Electronic health records (EHRs) are used in hospitals to store diagnoses, clinician notes, examinations, lab results, and interventions for each patient. Grouping patients into distinct subsets, for example, via clustering, may enable the discovery of unknown disease patterns or comorbidities, which could eventually lead to better treatment through personalized medicine. Patient data derived from EHRs is heterogeneous and temporally irregular. Therefore, traditional machine learning methods like PCA are ill-suited for analysis of EHR-derived patient data. We propose to address these issues with a new methodology based on training a gated recurrent unit (GRU) autoencoder directly on health record data. Our method learns a low-dimensional feature space by training on patient data time series, where the time of each data point is expressed explicitly. We use positional encodings for time, allowing our model to better handle the temporal irregularity of the data. We apply our method to data from the Medical Information Mart for Intensive Care (MIMIC-III). Using our data-derived feature space, we can cluster patients into groups representing major classes of disease patterns. Additionally, we show that our feature space exhibits a rich substructure at multiple scales.
format Online
Article
Text
id pubmed-10008580
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-100085802023-03-13 Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups Merkelbach, Kilian Schaper, Steffen Diedrich, Christian Fritsch, Sebastian Johannes Schuppert, Andreas Sci Rep Article Electronic health records (EHRs) are used in hospitals to store diagnoses, clinician notes, examinations, lab results, and interventions for each patient. Grouping patients into distinct subsets, for example, via clustering, may enable the discovery of unknown disease patterns or comorbidities, which could eventually lead to better treatment through personalized medicine. Patient data derived from EHRs is heterogeneous and temporally irregular. Therefore, traditional machine learning methods like PCA are ill-suited for analysis of EHR-derived patient data. We propose to address these issues with a new methodology based on training a gated recurrent unit (GRU) autoencoder directly on health record data. Our method learns a low-dimensional feature space by training on patient data time series, where the time of each data point is expressed explicitly. We use positional encodings for time, allowing our model to better handle the temporal irregularity of the data. We apply our method to data from the Medical Information Mart for Intensive Care (MIMIC-III). Using our data-derived feature space, we can cluster patients into groups representing major classes of disease patterns. Additionally, we show that our feature space exhibits a rich substructure at multiple scales. Nature Publishing Group UK 2023-03-11 /pmc/articles/PMC10008580/ /pubmed/36906642 http://dx.doi.org/10.1038/s41598-023-30986-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Merkelbach, Kilian
Schaper, Steffen
Diedrich, Christian
Fritsch, Sebastian Johannes
Schuppert, Andreas
Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups
title Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups
title_full Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups
title_fullStr Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups
title_full_unstemmed Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups
title_short Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups
title_sort novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of icu patient subgroups
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10008580/
https://www.ncbi.nlm.nih.gov/pubmed/36906642
http://dx.doi.org/10.1038/s41598-023-30986-1
work_keys_str_mv AT merkelbachkilian novelarchitectureforgatedrecurrentunitautoencodertrainedontimeseriesfromelectronichealthrecordsenablesdetectionoficupatientsubgroups
AT schapersteffen novelarchitectureforgatedrecurrentunitautoencodertrainedontimeseriesfromelectronichealthrecordsenablesdetectionoficupatientsubgroups
AT diedrichchristian novelarchitectureforgatedrecurrentunitautoencodertrainedontimeseriesfromelectronichealthrecordsenablesdetectionoficupatientsubgroups
AT fritschsebastianjohannes novelarchitectureforgatedrecurrentunitautoencodertrainedontimeseriesfromelectronichealthrecordsenablesdetectionoficupatientsubgroups
AT schuppertandreas novelarchitectureforgatedrecurrentunitautoencodertrainedontimeseriesfromelectronichealthrecordsenablesdetectionoficupatientsubgroups