Cargando…

A richly interactive exploratory data analysis and visualization tool using electronic medical records

BACKGROUND: Electronic medical records (EMRs) contain vast amounts of data that is of great interest to physicians, clinical researchers, and medial policy makers. As the size, complexity, and accessibility of EMRs grow, the ability to extract meaningful information from them has become an increasin...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Chih-Wei, Lu, Richard, Iqbal, Usman, Lin, Shen-Hsien, Nguyen, Phung Anh (Alex), Yang, Hsuan-Chia, Wang, Chun-Fu, Li, Jianping, Ma, Kwan-Liu, Li, Yu-Chuan (Jack), Jian, Wen-Shan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4643519/
https://www.ncbi.nlm.nih.gov/pubmed/26563282
http://dx.doi.org/10.1186/s12911-015-0218-7
_version_ 1782400528522674176
author Huang, Chih-Wei
Lu, Richard
Iqbal, Usman
Lin, Shen-Hsien
Nguyen, Phung Anh (Alex)
Yang, Hsuan-Chia
Wang, Chun-Fu
Li, Jianping
Ma, Kwan-Liu
Li, Yu-Chuan (Jack)
Jian, Wen-Shan
author_facet Huang, Chih-Wei
Lu, Richard
Iqbal, Usman
Lin, Shen-Hsien
Nguyen, Phung Anh (Alex)
Yang, Hsuan-Chia
Wang, Chun-Fu
Li, Jianping
Ma, Kwan-Liu
Li, Yu-Chuan (Jack)
Jian, Wen-Shan
author_sort Huang, Chih-Wei
collection PubMed
description BACKGROUND: Electronic medical records (EMRs) contain vast amounts of data that is of great interest to physicians, clinical researchers, and medial policy makers. As the size, complexity, and accessibility of EMRs grow, the ability to extract meaningful information from them has become an increasingly important problem to solve. METHODS: We develop a standardized data analysis process to support cohort study with a focus on a particular disease. We use an interactive divide-and-conquer approach to classify patients into relatively uniform within each group. It is a repetitive process enabling the user to divide the data into homogeneous subsets that can be visually examined, compared, and refined. The final visualization was driven by the transformed data, and user feedback direct to the corresponding operators which completed the repetitive process. The output results are shown in a Sankey diagram-style timeline, which is a particular kind of flow diagram for showing factors’ states and transitions over time. RESULTS: This paper presented a visually rich, interactive web-based application, which could enable researchers to study any cohorts over time by using EMR data. The resulting visualizations help uncover hidden information in the data, compare differences between patient groups, determine critical factors that influence a particular disease, and help direct further analyses. We introduced and demonstrated this tool by using EMRs of 14,567 Chronic Kidney Disease (CKD) patients. CONCLUSIONS: We developed a visual mining system to support exploratory data analysis of multi-dimensional categorical EMR data. By using CKD as a model of disease, it was assembled by automated correlational analysis and human-curated visual evaluation. The visualization methods such as Sankey diagram can reveal useful knowledge about the particular disease cohort and the trajectories of the disease over time.
format Online
Article
Text
id pubmed-4643519
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46435192015-11-14 A richly interactive exploratory data analysis and visualization tool using electronic medical records Huang, Chih-Wei Lu, Richard Iqbal, Usman Lin, Shen-Hsien Nguyen, Phung Anh (Alex) Yang, Hsuan-Chia Wang, Chun-Fu Li, Jianping Ma, Kwan-Liu Li, Yu-Chuan (Jack) Jian, Wen-Shan BMC Med Inform Decis Mak Research Article BACKGROUND: Electronic medical records (EMRs) contain vast amounts of data that is of great interest to physicians, clinical researchers, and medial policy makers. As the size, complexity, and accessibility of EMRs grow, the ability to extract meaningful information from them has become an increasingly important problem to solve. METHODS: We develop a standardized data analysis process to support cohort study with a focus on a particular disease. We use an interactive divide-and-conquer approach to classify patients into relatively uniform within each group. It is a repetitive process enabling the user to divide the data into homogeneous subsets that can be visually examined, compared, and refined. The final visualization was driven by the transformed data, and user feedback direct to the corresponding operators which completed the repetitive process. The output results are shown in a Sankey diagram-style timeline, which is a particular kind of flow diagram for showing factors’ states and transitions over time. RESULTS: This paper presented a visually rich, interactive web-based application, which could enable researchers to study any cohorts over time by using EMR data. The resulting visualizations help uncover hidden information in the data, compare differences between patient groups, determine critical factors that influence a particular disease, and help direct further analyses. We introduced and demonstrated this tool by using EMRs of 14,567 Chronic Kidney Disease (CKD) patients. CONCLUSIONS: We developed a visual mining system to support exploratory data analysis of multi-dimensional categorical EMR data. By using CKD as a model of disease, it was assembled by automated correlational analysis and human-curated visual evaluation. The visualization methods such as Sankey diagram can reveal useful knowledge about the particular disease cohort and the trajectories of the disease over time. BioMed Central 2015-11-12 /pmc/articles/PMC4643519/ /pubmed/26563282 http://dx.doi.org/10.1186/s12911-015-0218-7 Text en © Huang et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Huang, Chih-Wei
Lu, Richard
Iqbal, Usman
Lin, Shen-Hsien
Nguyen, Phung Anh (Alex)
Yang, Hsuan-Chia
Wang, Chun-Fu
Li, Jianping
Ma, Kwan-Liu
Li, Yu-Chuan (Jack)
Jian, Wen-Shan
A richly interactive exploratory data analysis and visualization tool using electronic medical records
title A richly interactive exploratory data analysis and visualization tool using electronic medical records
title_full A richly interactive exploratory data analysis and visualization tool using electronic medical records
title_fullStr A richly interactive exploratory data analysis and visualization tool using electronic medical records
title_full_unstemmed A richly interactive exploratory data analysis and visualization tool using electronic medical records
title_short A richly interactive exploratory data analysis and visualization tool using electronic medical records
title_sort richly interactive exploratory data analysis and visualization tool using electronic medical records
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4643519/
https://www.ncbi.nlm.nih.gov/pubmed/26563282
http://dx.doi.org/10.1186/s12911-015-0218-7
work_keys_str_mv AT huangchihwei arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT lurichard arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT iqbalusman arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT linshenhsien arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT nguyenphunganhalex arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT yanghsuanchia arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT wangchunfu arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT lijianping arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT makwanliu arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT liyuchuanjack arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT jianwenshan arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT huangchihwei richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT lurichard richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT iqbalusman richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT linshenhsien richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT nguyenphunganhalex richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT yanghsuanchia richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT wangchunfu richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT lijianping richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT makwanliu richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT liyuchuanjack richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords
AT jianwenshan richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords