Cargando…
A richly interactive exploratory data analysis and visualization tool using electronic medical records
BACKGROUND: Electronic medical records (EMRs) contain vast amounts of data that is of great interest to physicians, clinical researchers, and medial policy makers. As the size, complexity, and accessibility of EMRs grow, the ability to extract meaningful information from them has become an increasin...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4643519/ https://www.ncbi.nlm.nih.gov/pubmed/26563282 http://dx.doi.org/10.1186/s12911-015-0218-7 |
_version_ | 1782400528522674176 |
---|---|
author | Huang, Chih-Wei Lu, Richard Iqbal, Usman Lin, Shen-Hsien Nguyen, Phung Anh (Alex) Yang, Hsuan-Chia Wang, Chun-Fu Li, Jianping Ma, Kwan-Liu Li, Yu-Chuan (Jack) Jian, Wen-Shan |
author_facet | Huang, Chih-Wei Lu, Richard Iqbal, Usman Lin, Shen-Hsien Nguyen, Phung Anh (Alex) Yang, Hsuan-Chia Wang, Chun-Fu Li, Jianping Ma, Kwan-Liu Li, Yu-Chuan (Jack) Jian, Wen-Shan |
author_sort | Huang, Chih-Wei |
collection | PubMed |
description | BACKGROUND: Electronic medical records (EMRs) contain vast amounts of data that is of great interest to physicians, clinical researchers, and medial policy makers. As the size, complexity, and accessibility of EMRs grow, the ability to extract meaningful information from them has become an increasingly important problem to solve. METHODS: We develop a standardized data analysis process to support cohort study with a focus on a particular disease. We use an interactive divide-and-conquer approach to classify patients into relatively uniform within each group. It is a repetitive process enabling the user to divide the data into homogeneous subsets that can be visually examined, compared, and refined. The final visualization was driven by the transformed data, and user feedback direct to the corresponding operators which completed the repetitive process. The output results are shown in a Sankey diagram-style timeline, which is a particular kind of flow diagram for showing factors’ states and transitions over time. RESULTS: This paper presented a visually rich, interactive web-based application, which could enable researchers to study any cohorts over time by using EMR data. The resulting visualizations help uncover hidden information in the data, compare differences between patient groups, determine critical factors that influence a particular disease, and help direct further analyses. We introduced and demonstrated this tool by using EMRs of 14,567 Chronic Kidney Disease (CKD) patients. CONCLUSIONS: We developed a visual mining system to support exploratory data analysis of multi-dimensional categorical EMR data. By using CKD as a model of disease, it was assembled by automated correlational analysis and human-curated visual evaluation. The visualization methods such as Sankey diagram can reveal useful knowledge about the particular disease cohort and the trajectories of the disease over time. |
format | Online Article Text |
id | pubmed-4643519 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46435192015-11-14 A richly interactive exploratory data analysis and visualization tool using electronic medical records Huang, Chih-Wei Lu, Richard Iqbal, Usman Lin, Shen-Hsien Nguyen, Phung Anh (Alex) Yang, Hsuan-Chia Wang, Chun-Fu Li, Jianping Ma, Kwan-Liu Li, Yu-Chuan (Jack) Jian, Wen-Shan BMC Med Inform Decis Mak Research Article BACKGROUND: Electronic medical records (EMRs) contain vast amounts of data that is of great interest to physicians, clinical researchers, and medial policy makers. As the size, complexity, and accessibility of EMRs grow, the ability to extract meaningful information from them has become an increasingly important problem to solve. METHODS: We develop a standardized data analysis process to support cohort study with a focus on a particular disease. We use an interactive divide-and-conquer approach to classify patients into relatively uniform within each group. It is a repetitive process enabling the user to divide the data into homogeneous subsets that can be visually examined, compared, and refined. The final visualization was driven by the transformed data, and user feedback direct to the corresponding operators which completed the repetitive process. The output results are shown in a Sankey diagram-style timeline, which is a particular kind of flow diagram for showing factors’ states and transitions over time. RESULTS: This paper presented a visually rich, interactive web-based application, which could enable researchers to study any cohorts over time by using EMR data. The resulting visualizations help uncover hidden information in the data, compare differences between patient groups, determine critical factors that influence a particular disease, and help direct further analyses. We introduced and demonstrated this tool by using EMRs of 14,567 Chronic Kidney Disease (CKD) patients. CONCLUSIONS: We developed a visual mining system to support exploratory data analysis of multi-dimensional categorical EMR data. By using CKD as a model of disease, it was assembled by automated correlational analysis and human-curated visual evaluation. The visualization methods such as Sankey diagram can reveal useful knowledge about the particular disease cohort and the trajectories of the disease over time. BioMed Central 2015-11-12 /pmc/articles/PMC4643519/ /pubmed/26563282 http://dx.doi.org/10.1186/s12911-015-0218-7 Text en © Huang et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Huang, Chih-Wei Lu, Richard Iqbal, Usman Lin, Shen-Hsien Nguyen, Phung Anh (Alex) Yang, Hsuan-Chia Wang, Chun-Fu Li, Jianping Ma, Kwan-Liu Li, Yu-Chuan (Jack) Jian, Wen-Shan A richly interactive exploratory data analysis and visualization tool using electronic medical records |
title | A richly interactive exploratory data analysis and visualization tool using electronic medical records |
title_full | A richly interactive exploratory data analysis and visualization tool using electronic medical records |
title_fullStr | A richly interactive exploratory data analysis and visualization tool using electronic medical records |
title_full_unstemmed | A richly interactive exploratory data analysis and visualization tool using electronic medical records |
title_short | A richly interactive exploratory data analysis and visualization tool using electronic medical records |
title_sort | richly interactive exploratory data analysis and visualization tool using electronic medical records |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4643519/ https://www.ncbi.nlm.nih.gov/pubmed/26563282 http://dx.doi.org/10.1186/s12911-015-0218-7 |
work_keys_str_mv | AT huangchihwei arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT lurichard arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT iqbalusman arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT linshenhsien arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT nguyenphunganhalex arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT yanghsuanchia arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT wangchunfu arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT lijianping arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT makwanliu arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT liyuchuanjack arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT jianwenshan arichlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT huangchihwei richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT lurichard richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT iqbalusman richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT linshenhsien richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT nguyenphunganhalex richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT yanghsuanchia richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT wangchunfu richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT lijianping richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT makwanliu richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT liyuchuanjack richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords AT jianwenshan richlyinteractiveexploratorydataanalysisandvisualizationtoolusingelectronicmedicalrecords |