Cargando…

Automated Categorization of Systemic Disease and Duration From Electronic Medical Record System Data Using Finite-State Machine Modeling: Prospective Validation Study

BACKGROUND: One of the major challenges in the health care sector is that approximately 80% of generated data remains unstructured and unused. Since it is difficult to handle unstructured data from electronic medical record systems, it tends to be neglected for analyses in most hospitals and medical...

Descripción completa

Detalles Bibliográficos
Autores principales: Sai Prashanthi, Gumpili, Deva, Ayush, Vadapalli, Ranganath, Das, Anthony Vipin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7775202/
https://www.ncbi.nlm.nih.gov/pubmed/33331823
http://dx.doi.org/10.2196/24490
_version_ 1783630427730214912
author Sai Prashanthi, Gumpili
Deva, Ayush
Vadapalli, Ranganath
Das, Anthony Vipin
author_facet Sai Prashanthi, Gumpili
Deva, Ayush
Vadapalli, Ranganath
Das, Anthony Vipin
author_sort Sai Prashanthi, Gumpili
collection PubMed
description BACKGROUND: One of the major challenges in the health care sector is that approximately 80% of generated data remains unstructured and unused. Since it is difficult to handle unstructured data from electronic medical record systems, it tends to be neglected for analyses in most hospitals and medical centers. Therefore, there is a need to analyze unstructured big data in health care systems so that we can optimally utilize and unearth all unexploited information from it. OBJECTIVE: In this study, we aimed to extract a list of diseases and associated keywords along with the corresponding time durations from an indigenously developed electronic medical record system and describe the possibility of analytics from the acquired datasets. METHODS: We propose a novel, finite-state machine to sequentially detect and cluster disease names from patients’ medical history. We defined 3 states in the finite-state machine and transition matrix, which depend on the identified keyword. In addition, we also defined a state-change action matrix, which is essentially an action associated with each transition. The dataset used in this study was obtained from an indigenously developed electronic medical record system called eyeSmart that was implemented across a large, multitier ophthalmology network in India. The dataset included patients’ past medical history and contained records of 10,000 distinct patients. RESULTS: We extracted disease names and associated keywords by using the finite-state machine with an accuracy of 95%, sensitivity of 94.9%, and positive predictive value of 100%. For the extraction of the duration of disease, the machine’s accuracy was 93%, sensitivity was 92.9%, and the positive predictive value was 100%. CONCLUSIONS: We demonstrated that the finite-state machine we developed in this study can be used to accurately identify disease names, associated keywords, and time durations from a large cohort of patient records obtained using an electronic medical record system.
format Online
Article
Text
id pubmed-7775202
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-77752022021-01-15 Automated Categorization of Systemic Disease and Duration From Electronic Medical Record System Data Using Finite-State Machine Modeling: Prospective Validation Study Sai Prashanthi, Gumpili Deva, Ayush Vadapalli, Ranganath Das, Anthony Vipin JMIR Form Res Original Paper BACKGROUND: One of the major challenges in the health care sector is that approximately 80% of generated data remains unstructured and unused. Since it is difficult to handle unstructured data from electronic medical record systems, it tends to be neglected for analyses in most hospitals and medical centers. Therefore, there is a need to analyze unstructured big data in health care systems so that we can optimally utilize and unearth all unexploited information from it. OBJECTIVE: In this study, we aimed to extract a list of diseases and associated keywords along with the corresponding time durations from an indigenously developed electronic medical record system and describe the possibility of analytics from the acquired datasets. METHODS: We propose a novel, finite-state machine to sequentially detect and cluster disease names from patients’ medical history. We defined 3 states in the finite-state machine and transition matrix, which depend on the identified keyword. In addition, we also defined a state-change action matrix, which is essentially an action associated with each transition. The dataset used in this study was obtained from an indigenously developed electronic medical record system called eyeSmart that was implemented across a large, multitier ophthalmology network in India. The dataset included patients’ past medical history and contained records of 10,000 distinct patients. RESULTS: We extracted disease names and associated keywords by using the finite-state machine with an accuracy of 95%, sensitivity of 94.9%, and positive predictive value of 100%. For the extraction of the duration of disease, the machine’s accuracy was 93%, sensitivity was 92.9%, and the positive predictive value was 100%. CONCLUSIONS: We demonstrated that the finite-state machine we developed in this study can be used to accurately identify disease names, associated keywords, and time durations from a large cohort of patient records obtained using an electronic medical record system. JMIR Publications 2020-12-17 /pmc/articles/PMC7775202/ /pubmed/33331823 http://dx.doi.org/10.2196/24490 Text en ©Gumpili Sai Prashanthi, Ayush Deva, Ranganath Vadapalli, Anthony Vipin Das. Originally published in JMIR Formative Research (http://formative.jmir.org), 17.12.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on http://formative.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Sai Prashanthi, Gumpili
Deva, Ayush
Vadapalli, Ranganath
Das, Anthony Vipin
Automated Categorization of Systemic Disease and Duration From Electronic Medical Record System Data Using Finite-State Machine Modeling: Prospective Validation Study
title Automated Categorization of Systemic Disease and Duration From Electronic Medical Record System Data Using Finite-State Machine Modeling: Prospective Validation Study
title_full Automated Categorization of Systemic Disease and Duration From Electronic Medical Record System Data Using Finite-State Machine Modeling: Prospective Validation Study
title_fullStr Automated Categorization of Systemic Disease and Duration From Electronic Medical Record System Data Using Finite-State Machine Modeling: Prospective Validation Study
title_full_unstemmed Automated Categorization of Systemic Disease and Duration From Electronic Medical Record System Data Using Finite-State Machine Modeling: Prospective Validation Study
title_short Automated Categorization of Systemic Disease and Duration From Electronic Medical Record System Data Using Finite-State Machine Modeling: Prospective Validation Study
title_sort automated categorization of systemic disease and duration from electronic medical record system data using finite-state machine modeling: prospective validation study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7775202/
https://www.ncbi.nlm.nih.gov/pubmed/33331823
http://dx.doi.org/10.2196/24490
work_keys_str_mv AT saiprashanthigumpili automatedcategorizationofsystemicdiseaseanddurationfromelectronicmedicalrecordsystemdatausingfinitestatemachinemodelingprospectivevalidationstudy
AT devaayush automatedcategorizationofsystemicdiseaseanddurationfromelectronicmedicalrecordsystemdatausingfinitestatemachinemodelingprospectivevalidationstudy
AT vadapalliranganath automatedcategorizationofsystemicdiseaseanddurationfromelectronicmedicalrecordsystemdatausingfinitestatemachinemodelingprospectivevalidationstudy
AT dasanthonyvipin automatedcategorizationofsystemicdiseaseanddurationfromelectronicmedicalrecordsystemdatausingfinitestatemachinemodelingprospectivevalidationstudy