Cargando…
Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups
Electronic health records (EHRs) are used in hospitals to store diagnoses, clinician notes, examinations, lab results, and interventions for each patient. Grouping patients into distinct subsets, for example, via clustering, may enable the discovery of unknown disease patterns or comorbidities, whic...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10008580/ https://www.ncbi.nlm.nih.gov/pubmed/36906642 http://dx.doi.org/10.1038/s41598-023-30986-1 |
_version_ | 1784905789273538560 |
---|---|
author | Merkelbach, Kilian Schaper, Steffen Diedrich, Christian Fritsch, Sebastian Johannes Schuppert, Andreas |
author_facet | Merkelbach, Kilian Schaper, Steffen Diedrich, Christian Fritsch, Sebastian Johannes Schuppert, Andreas |
author_sort | Merkelbach, Kilian |
collection | PubMed |
description | Electronic health records (EHRs) are used in hospitals to store diagnoses, clinician notes, examinations, lab results, and interventions for each patient. Grouping patients into distinct subsets, for example, via clustering, may enable the discovery of unknown disease patterns or comorbidities, which could eventually lead to better treatment through personalized medicine. Patient data derived from EHRs is heterogeneous and temporally irregular. Therefore, traditional machine learning methods like PCA are ill-suited for analysis of EHR-derived patient data. We propose to address these issues with a new methodology based on training a gated recurrent unit (GRU) autoencoder directly on health record data. Our method learns a low-dimensional feature space by training on patient data time series, where the time of each data point is expressed explicitly. We use positional encodings for time, allowing our model to better handle the temporal irregularity of the data. We apply our method to data from the Medical Information Mart for Intensive Care (MIMIC-III). Using our data-derived feature space, we can cluster patients into groups representing major classes of disease patterns. Additionally, we show that our feature space exhibits a rich substructure at multiple scales. |
format | Online Article Text |
id | pubmed-10008580 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-100085802023-03-13 Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups Merkelbach, Kilian Schaper, Steffen Diedrich, Christian Fritsch, Sebastian Johannes Schuppert, Andreas Sci Rep Article Electronic health records (EHRs) are used in hospitals to store diagnoses, clinician notes, examinations, lab results, and interventions for each patient. Grouping patients into distinct subsets, for example, via clustering, may enable the discovery of unknown disease patterns or comorbidities, which could eventually lead to better treatment through personalized medicine. Patient data derived from EHRs is heterogeneous and temporally irregular. Therefore, traditional machine learning methods like PCA are ill-suited for analysis of EHR-derived patient data. We propose to address these issues with a new methodology based on training a gated recurrent unit (GRU) autoencoder directly on health record data. Our method learns a low-dimensional feature space by training on patient data time series, where the time of each data point is expressed explicitly. We use positional encodings for time, allowing our model to better handle the temporal irregularity of the data. We apply our method to data from the Medical Information Mart for Intensive Care (MIMIC-III). Using our data-derived feature space, we can cluster patients into groups representing major classes of disease patterns. Additionally, we show that our feature space exhibits a rich substructure at multiple scales. Nature Publishing Group UK 2023-03-11 /pmc/articles/PMC10008580/ /pubmed/36906642 http://dx.doi.org/10.1038/s41598-023-30986-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Merkelbach, Kilian Schaper, Steffen Diedrich, Christian Fritsch, Sebastian Johannes Schuppert, Andreas Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups |
title | Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups |
title_full | Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups |
title_fullStr | Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups |
title_full_unstemmed | Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups |
title_short | Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups |
title_sort | novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of icu patient subgroups |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10008580/ https://www.ncbi.nlm.nih.gov/pubmed/36906642 http://dx.doi.org/10.1038/s41598-023-30986-1 |
work_keys_str_mv | AT merkelbachkilian novelarchitectureforgatedrecurrentunitautoencodertrainedontimeseriesfromelectronichealthrecordsenablesdetectionoficupatientsubgroups AT schapersteffen novelarchitectureforgatedrecurrentunitautoencodertrainedontimeseriesfromelectronichealthrecordsenablesdetectionoficupatientsubgroups AT diedrichchristian novelarchitectureforgatedrecurrentunitautoencodertrainedontimeseriesfromelectronichealthrecordsenablesdetectionoficupatientsubgroups AT fritschsebastianjohannes novelarchitectureforgatedrecurrentunitautoencodertrainedontimeseriesfromelectronichealthrecordsenablesdetectionoficupatientsubgroups AT schuppertandreas novelarchitectureforgatedrecurrentunitautoencodertrainedontimeseriesfromelectronichealthrecordsenablesdetectionoficupatientsubgroups |