Cargando…

Temporal condition pattern mining in large, sparse electronic health record data: A case study in characterizing pediatric asthma

OBJECTIVE: This study introduces a temporal condition pattern mining methodology to address the sparse nature of coded condition concept utilization in electronic health record data. As a validation study, we applied this method to reveal condition patterns surrounding an initial diagnosis of pediat...

Descripción completa

Detalles Bibliográficos
Autores principales: Campbell, Elizabeth A, Bass, Ellen J, Masino, Aaron J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7075539/
https://www.ncbi.nlm.nih.gov/pubmed/32049282
http://dx.doi.org/10.1093/jamia/ocaa005
_version_ 1783507053738721280
author Campbell, Elizabeth A
Bass, Ellen J
Masino, Aaron J
author_facet Campbell, Elizabeth A
Bass, Ellen J
Masino, Aaron J
author_sort Campbell, Elizabeth A
collection PubMed
description OBJECTIVE: This study introduces a temporal condition pattern mining methodology to address the sparse nature of coded condition concept utilization in electronic health record data. As a validation study, we applied this method to reveal condition patterns surrounding an initial diagnosis of pediatric asthma. MATERIALS AND METHODS: The SPADE (Sequential PAttern Discovery using Equivalence classes) algorithm was used to identify common temporal condition patterns surrounding the initial diagnosis of pediatric asthma in a study population of 71 824 patients from the Children’s Hospital of Philadelphia. SPADE was applied to a dataset with diagnoses coded using International Classification of Diseases (ICD) concepts and separately to a dataset with the ICD codes mapped to their corresponding expanded diagnostic clusters (EDCs). Common temporal condition patterns surrounding the initial diagnosis of pediatric asthma ascertained by SPADE from both the ICD and EDC datasets were compared. RESULTS: SPADE identified 36 unique diagnoses in the mapped EDC dataset, whereas only 19 were recognized in the ICD dataset. Temporal trends in condition diagnoses ascertained from the EDC data were not discoverable in the ICD dataset. DISCUSSION: Mining frequent temporal condition patterns from large electronic health record datasets may reveal previously unknown associations between diagnoses that could inform future research into causation or other relationships. Mapping sparsely coded medical concepts into homogenous groups was essential to discovering potentially useful information from our dataset. CONCLUSIONS: We expect that the presented methodology is applicable to the study of diagnostic trajectories for other clinical conditions and can be extended to study temporal patterns of other coded medical concepts such as medications and procedures.
format Online
Article
Text
id pubmed-7075539
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-70755392020-03-18 Temporal condition pattern mining in large, sparse electronic health record data: A case study in characterizing pediatric asthma Campbell, Elizabeth A Bass, Ellen J Masino, Aaron J J Am Med Inform Assoc Research and Applications OBJECTIVE: This study introduces a temporal condition pattern mining methodology to address the sparse nature of coded condition concept utilization in electronic health record data. As a validation study, we applied this method to reveal condition patterns surrounding an initial diagnosis of pediatric asthma. MATERIALS AND METHODS: The SPADE (Sequential PAttern Discovery using Equivalence classes) algorithm was used to identify common temporal condition patterns surrounding the initial diagnosis of pediatric asthma in a study population of 71 824 patients from the Children’s Hospital of Philadelphia. SPADE was applied to a dataset with diagnoses coded using International Classification of Diseases (ICD) concepts and separately to a dataset with the ICD codes mapped to their corresponding expanded diagnostic clusters (EDCs). Common temporal condition patterns surrounding the initial diagnosis of pediatric asthma ascertained by SPADE from both the ICD and EDC datasets were compared. RESULTS: SPADE identified 36 unique diagnoses in the mapped EDC dataset, whereas only 19 were recognized in the ICD dataset. Temporal trends in condition diagnoses ascertained from the EDC data were not discoverable in the ICD dataset. DISCUSSION: Mining frequent temporal condition patterns from large electronic health record datasets may reveal previously unknown associations between diagnoses that could inform future research into causation or other relationships. Mapping sparsely coded medical concepts into homogenous groups was essential to discovering potentially useful information from our dataset. CONCLUSIONS: We expect that the presented methodology is applicable to the study of diagnostic trajectories for other clinical conditions and can be extended to study temporal patterns of other coded medical concepts such as medications and procedures. Oxford University Press 2020-02-12 /pmc/articles/PMC7075539/ /pubmed/32049282 http://dx.doi.org/10.1093/jamia/ocaa005 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research and Applications
Campbell, Elizabeth A
Bass, Ellen J
Masino, Aaron J
Temporal condition pattern mining in large, sparse electronic health record data: A case study in characterizing pediatric asthma
title Temporal condition pattern mining in large, sparse electronic health record data: A case study in characterizing pediatric asthma
title_full Temporal condition pattern mining in large, sparse electronic health record data: A case study in characterizing pediatric asthma
title_fullStr Temporal condition pattern mining in large, sparse electronic health record data: A case study in characterizing pediatric asthma
title_full_unstemmed Temporal condition pattern mining in large, sparse electronic health record data: A case study in characterizing pediatric asthma
title_short Temporal condition pattern mining in large, sparse electronic health record data: A case study in characterizing pediatric asthma
title_sort temporal condition pattern mining in large, sparse electronic health record data: a case study in characterizing pediatric asthma
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7075539/
https://www.ncbi.nlm.nih.gov/pubmed/32049282
http://dx.doi.org/10.1093/jamia/ocaa005
work_keys_str_mv AT campbellelizabetha temporalconditionpatternmininginlargesparseelectronichealthrecorddataacasestudyincharacterizingpediatricasthma
AT bassellenj temporalconditionpatternmininginlargesparseelectronichealthrecorddataacasestudyincharacterizingpediatricasthma
AT masinoaaronj temporalconditionpatternmininginlargesparseelectronichealthrecorddataacasestudyincharacterizingpediatricasthma