Cargando…
EHR problem list clustering for improved topic-space navigation
BACKGROUND: The amount of patient-related information within clinical information systems accumulates over time, especially in cases where patients suffer from chronic diseases with many hospitalizations and consultations. The diagnosis or problem list is an important feature of the electronic healt...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6448176/ https://www.ncbi.nlm.nih.gov/pubmed/30943968 http://dx.doi.org/10.1186/s12911-019-0789-9 |
_version_ | 1783408645225054208 |
---|---|
author | Kreuzthaler, Markus Pfeifer, Bastian Vera Ramos, Jose Antonio Kramer, Diether Grogger, Victor Bredenfeldt, Sylvia Pedevilla, Markus Krisper, Peter Schulz, Stefan |
author_facet | Kreuzthaler, Markus Pfeifer, Bastian Vera Ramos, Jose Antonio Kramer, Diether Grogger, Victor Bredenfeldt, Sylvia Pedevilla, Markus Krisper, Peter Schulz, Stefan |
author_sort | Kreuzthaler, Markus |
collection | PubMed |
description | BACKGROUND: The amount of patient-related information within clinical information systems accumulates over time, especially in cases where patients suffer from chronic diseases with many hospitalizations and consultations. The diagnosis or problem list is an important feature of the electronic health record, which provides a dynamic account of a patient’s current illness and past history. In the case of an Austrian hospital network, problem list entries are limited to fifty characters and are potentially linked to ICD-10. The requirement of producing ICD codes at each hospital stay, together with the length limitation of list items leads to highly redundant problem lists, which conflicts with the physicians’ need of getting a good overview of a patient in short time. This paper investigates a method, by which problem list items can be semantically grouped, in order to allow for fast navigation through patient-related topic spaces. METHODS: We applied a minimal language-dependent preprocessing strategy and mapped problem list entries as tf-idf weighted character 3-grams into a numerical vector space. Based on this representation we used the unweighted pair group method with arithmetic mean (UPGMA) clustering algorithm with cosine distances and inferred an optimal boundary in order to form semantically consistent topic spaces, taking into consideration different levels of dimensionality reduction via latent semantic analysis (LSA). RESULTS: With the proposed clustering approach, evaluated via an intra- and inter-patient scenario in combination with a natural language pipeline, we achieved an average compression rate of 80% of the initial list items forming consistent semantic topic spaces with an F-measure greater than 0.80 in both cases. The average number of identified topics in the intra-patient case (μ(Intra) = 78.4) was slightly lower than in the inter-patient case (μ(Inter) = 83.4). LSA-based feature space reduction had no significant positive performance impact in our investigations. CONCLUSIONS: The investigation presented here is centered on a data-driven solution to the known problem of information overload, which causes ineffective human-computer interactions at clinicians’ work places. This problem is addressed by navigable disease topic spaces where related items are grouped and the topics can be more easily accessed. |
format | Online Article Text |
id | pubmed-6448176 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-64481762019-04-15 EHR problem list clustering for improved topic-space navigation Kreuzthaler, Markus Pfeifer, Bastian Vera Ramos, Jose Antonio Kramer, Diether Grogger, Victor Bredenfeldt, Sylvia Pedevilla, Markus Krisper, Peter Schulz, Stefan BMC Med Inform Decis Mak Research BACKGROUND: The amount of patient-related information within clinical information systems accumulates over time, especially in cases where patients suffer from chronic diseases with many hospitalizations and consultations. The diagnosis or problem list is an important feature of the electronic health record, which provides a dynamic account of a patient’s current illness and past history. In the case of an Austrian hospital network, problem list entries are limited to fifty characters and are potentially linked to ICD-10. The requirement of producing ICD codes at each hospital stay, together with the length limitation of list items leads to highly redundant problem lists, which conflicts with the physicians’ need of getting a good overview of a patient in short time. This paper investigates a method, by which problem list items can be semantically grouped, in order to allow for fast navigation through patient-related topic spaces. METHODS: We applied a minimal language-dependent preprocessing strategy and mapped problem list entries as tf-idf weighted character 3-grams into a numerical vector space. Based on this representation we used the unweighted pair group method with arithmetic mean (UPGMA) clustering algorithm with cosine distances and inferred an optimal boundary in order to form semantically consistent topic spaces, taking into consideration different levels of dimensionality reduction via latent semantic analysis (LSA). RESULTS: With the proposed clustering approach, evaluated via an intra- and inter-patient scenario in combination with a natural language pipeline, we achieved an average compression rate of 80% of the initial list items forming consistent semantic topic spaces with an F-measure greater than 0.80 in both cases. The average number of identified topics in the intra-patient case (μ(Intra) = 78.4) was slightly lower than in the inter-patient case (μ(Inter) = 83.4). LSA-based feature space reduction had no significant positive performance impact in our investigations. CONCLUSIONS: The investigation presented here is centered on a data-driven solution to the known problem of information overload, which causes ineffective human-computer interactions at clinicians’ work places. This problem is addressed by navigable disease topic spaces where related items are grouped and the topics can be more easily accessed. BioMed Central 2019-04-04 /pmc/articles/PMC6448176/ /pubmed/30943968 http://dx.doi.org/10.1186/s12911-019-0789-9 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Kreuzthaler, Markus Pfeifer, Bastian Vera Ramos, Jose Antonio Kramer, Diether Grogger, Victor Bredenfeldt, Sylvia Pedevilla, Markus Krisper, Peter Schulz, Stefan EHR problem list clustering for improved topic-space navigation |
title | EHR problem list clustering for improved topic-space navigation |
title_full | EHR problem list clustering for improved topic-space navigation |
title_fullStr | EHR problem list clustering for improved topic-space navigation |
title_full_unstemmed | EHR problem list clustering for improved topic-space navigation |
title_short | EHR problem list clustering for improved topic-space navigation |
title_sort | ehr problem list clustering for improved topic-space navigation |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6448176/ https://www.ncbi.nlm.nih.gov/pubmed/30943968 http://dx.doi.org/10.1186/s12911-019-0789-9 |
work_keys_str_mv | AT kreuzthalermarkus ehrproblemlistclusteringforimprovedtopicspacenavigation AT pfeiferbastian ehrproblemlistclusteringforimprovedtopicspacenavigation AT veraramosjoseantonio ehrproblemlistclusteringforimprovedtopicspacenavigation AT kramerdiether ehrproblemlistclusteringforimprovedtopicspacenavigation AT groggervictor ehrproblemlistclusteringforimprovedtopicspacenavigation AT bredenfeldtsylvia ehrproblemlistclusteringforimprovedtopicspacenavigation AT pedevillamarkus ehrproblemlistclusteringforimprovedtopicspacenavigation AT krisperpeter ehrproblemlistclusteringforimprovedtopicspacenavigation AT schulzstefan ehrproblemlistclusteringforimprovedtopicspacenavigation |