Cargando…
A visual analytic approach for the identification of ICU patient subpopulations using ICD diagnostic codes
A large number of clinical concepts are categorized under standardized formats that ease the manipulation, understanding, analysis, and exchange of information. One of the most extended codifications is the International Classification of Diseases (ICD) used for characterizing diagnoses and clinical...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8049127/ https://www.ncbi.nlm.nih.gov/pubmed/33954230 http://dx.doi.org/10.7717/peerj-cs.430 |
_version_ | 1783679368699052032 |
---|---|
author | Alcaide, Daniel Aerts, Jan |
author_facet | Alcaide, Daniel Aerts, Jan |
author_sort | Alcaide, Daniel |
collection | PubMed |
description | A large number of clinical concepts are categorized under standardized formats that ease the manipulation, understanding, analysis, and exchange of information. One of the most extended codifications is the International Classification of Diseases (ICD) used for characterizing diagnoses and clinical procedures. With formatted ICD concepts, a patient profile can be described through a set of standardized and sorted attributes according to the relevance or chronology of events. This structured data is fundamental to quantify the similarity between patients and detect relevant clinical characteristics. Data visualization tools allow the representation and comprehension of data patterns, usually of a high dimensional nature, where only a partial picture can be projected. In this paper, we provide a visual analytics approach for the identification of homogeneous patient cohorts by combining custom distance metrics with a flexible dimensionality reduction technique. First we define a new metric to measure the similarity between diagnosis profiles through the concordance and relevance of events. Second we describe a variation of the Simplified Topological Abstraction of Data (STAD) dimensionality reduction technique to enhance the projection of signals preserving the global structure of data. The MIMIC-III clinical database is used for implementing the analysis into an interactive dashboard, providing a highly expressive environment for the exploration and comparison of patients groups with at least one identical diagnostic ICD code. The combination of the distance metric and STAD not only allows the identification of patterns but also provides a new layer of information to establish additional relationships between patient cohorts. The method and tool presented here add a valuable new approach for exploring heterogeneous patient populations. In addition, the distance metric described can be applied in other domains that employ ordered lists of categorical data. |
format | Online Article Text |
id | pubmed-8049127 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-80491272021-05-04 A visual analytic approach for the identification of ICU patient subpopulations using ICD diagnostic codes Alcaide, Daniel Aerts, Jan PeerJ Comput Sci Data Science A large number of clinical concepts are categorized under standardized formats that ease the manipulation, understanding, analysis, and exchange of information. One of the most extended codifications is the International Classification of Diseases (ICD) used for characterizing diagnoses and clinical procedures. With formatted ICD concepts, a patient profile can be described through a set of standardized and sorted attributes according to the relevance or chronology of events. This structured data is fundamental to quantify the similarity between patients and detect relevant clinical characteristics. Data visualization tools allow the representation and comprehension of data patterns, usually of a high dimensional nature, where only a partial picture can be projected. In this paper, we provide a visual analytics approach for the identification of homogeneous patient cohorts by combining custom distance metrics with a flexible dimensionality reduction technique. First we define a new metric to measure the similarity between diagnosis profiles through the concordance and relevance of events. Second we describe a variation of the Simplified Topological Abstraction of Data (STAD) dimensionality reduction technique to enhance the projection of signals preserving the global structure of data. The MIMIC-III clinical database is used for implementing the analysis into an interactive dashboard, providing a highly expressive environment for the exploration and comparison of patients groups with at least one identical diagnostic ICD code. The combination of the distance metric and STAD not only allows the identification of patterns but also provides a new layer of information to establish additional relationships between patient cohorts. The method and tool presented here add a valuable new approach for exploring heterogeneous patient populations. In addition, the distance metric described can be applied in other domains that employ ordered lists of categorical data. PeerJ Inc. 2021-04-06 /pmc/articles/PMC8049127/ /pubmed/33954230 http://dx.doi.org/10.7717/peerj-cs.430 Text en © 2021 Alcaide and Aerts https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Data Science Alcaide, Daniel Aerts, Jan A visual analytic approach for the identification of ICU patient subpopulations using ICD diagnostic codes |
title | A visual analytic approach for the identification of ICU patient subpopulations using ICD diagnostic codes |
title_full | A visual analytic approach for the identification of ICU patient subpopulations using ICD diagnostic codes |
title_fullStr | A visual analytic approach for the identification of ICU patient subpopulations using ICD diagnostic codes |
title_full_unstemmed | A visual analytic approach for the identification of ICU patient subpopulations using ICD diagnostic codes |
title_short | A visual analytic approach for the identification of ICU patient subpopulations using ICD diagnostic codes |
title_sort | visual analytic approach for the identification of icu patient subpopulations using icd diagnostic codes |
topic | Data Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8049127/ https://www.ncbi.nlm.nih.gov/pubmed/33954230 http://dx.doi.org/10.7717/peerj-cs.430 |
work_keys_str_mv | AT alcaidedaniel avisualanalyticapproachfortheidentificationoficupatientsubpopulationsusingicddiagnosticcodes AT aertsjan avisualanalyticapproachfortheidentificationoficupatientsubpopulationsusingicddiagnosticcodes AT alcaidedaniel visualanalyticapproachfortheidentificationoficupatientsubpopulationsusingicddiagnosticcodes AT aertsjan visualanalyticapproachfortheidentificationoficupatientsubpopulationsusingicddiagnosticcodes |