Cargando…

EHR problem list clustering for improved topic-space navigation

BACKGROUND: The amount of patient-related information within clinical information systems accumulates over time, especially in cases where patients suffer from chronic diseases with many hospitalizations and consultations. The diagnosis or problem list is an important feature of the electronic healt...

Descripción completa

Detalles Bibliográficos
Autores principales: Kreuzthaler, Markus, Pfeifer, Bastian, Vera Ramos, Jose Antonio, Kramer, Diether, Grogger, Victor, Bredenfeldt, Sylvia, Pedevilla, Markus, Krisper, Peter, Schulz, Stefan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6448176/
https://www.ncbi.nlm.nih.gov/pubmed/30943968
http://dx.doi.org/10.1186/s12911-019-0789-9
_version_ 1783408645225054208
author Kreuzthaler, Markus
Pfeifer, Bastian
Vera Ramos, Jose Antonio
Kramer, Diether
Grogger, Victor
Bredenfeldt, Sylvia
Pedevilla, Markus
Krisper, Peter
Schulz, Stefan
author_facet Kreuzthaler, Markus
Pfeifer, Bastian
Vera Ramos, Jose Antonio
Kramer, Diether
Grogger, Victor
Bredenfeldt, Sylvia
Pedevilla, Markus
Krisper, Peter
Schulz, Stefan
author_sort Kreuzthaler, Markus
collection PubMed
description BACKGROUND: The amount of patient-related information within clinical information systems accumulates over time, especially in cases where patients suffer from chronic diseases with many hospitalizations and consultations. The diagnosis or problem list is an important feature of the electronic health record, which provides a dynamic account of a patient’s current illness and past history. In the case of an Austrian hospital network, problem list entries are limited to fifty characters and are potentially linked to ICD-10. The requirement of producing ICD codes at each hospital stay, together with the length limitation of list items leads to highly redundant problem lists, which conflicts with the physicians’ need of getting a good overview of a patient in short time. This paper investigates a method, by which problem list items can be semantically grouped, in order to allow for fast navigation through patient-related topic spaces. METHODS: We applied a minimal language-dependent preprocessing strategy and mapped problem list entries as tf-idf weighted character 3-grams into a numerical vector space. Based on this representation we used the unweighted pair group method with arithmetic mean (UPGMA) clustering algorithm with cosine distances and inferred an optimal boundary in order to form semantically consistent topic spaces, taking into consideration different levels of dimensionality reduction via latent semantic analysis (LSA). RESULTS: With the proposed clustering approach, evaluated via an intra- and inter-patient scenario in combination with a natural language pipeline, we achieved an average compression rate of 80% of the initial list items forming consistent semantic topic spaces with an F-measure greater than 0.80 in both cases. The average number of identified topics in the intra-patient case (μ(Intra) = 78.4) was slightly lower than in the inter-patient case (μ(Inter) = 83.4). LSA-based feature space reduction had no significant positive performance impact in our investigations. CONCLUSIONS: The investigation presented here is centered on a data-driven solution to the known problem of information overload, which causes ineffective human-computer interactions at clinicians’ work places. This problem is addressed by navigable disease topic spaces where related items are grouped and the topics can be more easily accessed.
format Online
Article
Text
id pubmed-6448176
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64481762019-04-15 EHR problem list clustering for improved topic-space navigation Kreuzthaler, Markus Pfeifer, Bastian Vera Ramos, Jose Antonio Kramer, Diether Grogger, Victor Bredenfeldt, Sylvia Pedevilla, Markus Krisper, Peter Schulz, Stefan BMC Med Inform Decis Mak Research BACKGROUND: The amount of patient-related information within clinical information systems accumulates over time, especially in cases where patients suffer from chronic diseases with many hospitalizations and consultations. The diagnosis or problem list is an important feature of the electronic health record, which provides a dynamic account of a patient’s current illness and past history. In the case of an Austrian hospital network, problem list entries are limited to fifty characters and are potentially linked to ICD-10. The requirement of producing ICD codes at each hospital stay, together with the length limitation of list items leads to highly redundant problem lists, which conflicts with the physicians’ need of getting a good overview of a patient in short time. This paper investigates a method, by which problem list items can be semantically grouped, in order to allow for fast navigation through patient-related topic spaces. METHODS: We applied a minimal language-dependent preprocessing strategy and mapped problem list entries as tf-idf weighted character 3-grams into a numerical vector space. Based on this representation we used the unweighted pair group method with arithmetic mean (UPGMA) clustering algorithm with cosine distances and inferred an optimal boundary in order to form semantically consistent topic spaces, taking into consideration different levels of dimensionality reduction via latent semantic analysis (LSA). RESULTS: With the proposed clustering approach, evaluated via an intra- and inter-patient scenario in combination with a natural language pipeline, we achieved an average compression rate of 80% of the initial list items forming consistent semantic topic spaces with an F-measure greater than 0.80 in both cases. The average number of identified topics in the intra-patient case (μ(Intra) = 78.4) was slightly lower than in the inter-patient case (μ(Inter) = 83.4). LSA-based feature space reduction had no significant positive performance impact in our investigations. CONCLUSIONS: The investigation presented here is centered on a data-driven solution to the known problem of information overload, which causes ineffective human-computer interactions at clinicians’ work places. This problem is addressed by navigable disease topic spaces where related items are grouped and the topics can be more easily accessed. BioMed Central 2019-04-04 /pmc/articles/PMC6448176/ /pubmed/30943968 http://dx.doi.org/10.1186/s12911-019-0789-9 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Kreuzthaler, Markus
Pfeifer, Bastian
Vera Ramos, Jose Antonio
Kramer, Diether
Grogger, Victor
Bredenfeldt, Sylvia
Pedevilla, Markus
Krisper, Peter
Schulz, Stefan
EHR problem list clustering for improved topic-space navigation
title EHR problem list clustering for improved topic-space navigation
title_full EHR problem list clustering for improved topic-space navigation
title_fullStr EHR problem list clustering for improved topic-space navigation
title_full_unstemmed EHR problem list clustering for improved topic-space navigation
title_short EHR problem list clustering for improved topic-space navigation
title_sort ehr problem list clustering for improved topic-space navigation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6448176/
https://www.ncbi.nlm.nih.gov/pubmed/30943968
http://dx.doi.org/10.1186/s12911-019-0789-9
work_keys_str_mv AT kreuzthalermarkus ehrproblemlistclusteringforimprovedtopicspacenavigation
AT pfeiferbastian ehrproblemlistclusteringforimprovedtopicspacenavigation
AT veraramosjoseantonio ehrproblemlistclusteringforimprovedtopicspacenavigation
AT kramerdiether ehrproblemlistclusteringforimprovedtopicspacenavigation
AT groggervictor ehrproblemlistclusteringforimprovedtopicspacenavigation
AT bredenfeldtsylvia ehrproblemlistclusteringforimprovedtopicspacenavigation
AT pedevillamarkus ehrproblemlistclusteringforimprovedtopicspacenavigation
AT krisperpeter ehrproblemlistclusteringforimprovedtopicspacenavigation
AT schulzstefan ehrproblemlistclusteringforimprovedtopicspacenavigation