Cargando…

Identification of temporal condition patterns associated with pediatric obesity incidence using sequence mining and big data

BACKGROUND: Electronic health records (EHRs) are potentially important components in addressing pediatric obesity in clinical settings and at the population level. This work aims to identify temporal condition patterns surrounding obesity incidence in a large pediatric population that may inform cli...

Descripción completa

Detalles Bibliográficos
Autores principales: Campbell, Elizabeth A., Qian, Ting, Miller, Jeffrey M., Bass, Ellen J., Masino, Aaron J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7381422/
https://www.ncbi.nlm.nih.gov/pubmed/32494036
http://dx.doi.org/10.1038/s41366-020-0614-7
_version_ 1783563043653812224
author Campbell, Elizabeth A.
Qian, Ting
Miller, Jeffrey M.
Bass, Ellen J.
Masino, Aaron J.
author_facet Campbell, Elizabeth A.
Qian, Ting
Miller, Jeffrey M.
Bass, Ellen J.
Masino, Aaron J.
author_sort Campbell, Elizabeth A.
collection PubMed
description BACKGROUND: Electronic health records (EHRs) are potentially important components in addressing pediatric obesity in clinical settings and at the population level. This work aims to identify temporal condition patterns surrounding obesity incidence in a large pediatric population that may inform clinical care and childhood obesity policy and prevention efforts. METHODS: EHR data from healthcare visits with an initial record of obesity incidence (index visit) from 2009 through 2016 at the Children’s Hospital of Philadelphia, and visits immediately before (pre-index) and after (post-index), were compared with a matched control population of patients with a healthy weight to characterize the prevalence of common diagnoses and condition trajectories. The study population consisted of 49,694 patients with pediatric obesity and their corresponding matched controls. The SPADE algorithm was used to identify common temporal condition patterns in the case population. McNemar’s test was used to assess the statistical significance of pattern prevalence differences between the case and control populations. RESULTS: SPADE identified 163 condition patterns that were present in at least 1% of cases; 80 were significantly more common among cases and 45 were significantly more common among controls (p < 0.05). Asthma and allergic rhinitis were strongly associated with childhood obesity incidence, particularly during the pre-index and index visits. Seven conditions were commonly diagnosed for cases exclusively during pre-index visits, including ear, nose, and throat disorders and gastroenteritis. CONCLUSIONS: The novel application of SPADE on a large retrospective dataset revealed temporally dependent condition associations with obesity incidence. Allergic rhinitis and asthma had a particularly high prevalence during pre-index visits. These conditions, along with those exclusively observed during pre-index visits, may represent signals of future obesity. While causation cannot be inferred from these associations, the temporal condition patterns identified here represent hypotheses that can be investigated to determine causal relationships in future obesity research.
format Online
Article
Text
id pubmed-7381422
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73814222020-08-04 Identification of temporal condition patterns associated with pediatric obesity incidence using sequence mining and big data Campbell, Elizabeth A. Qian, Ting Miller, Jeffrey M. Bass, Ellen J. Masino, Aaron J. Int J Obes (Lond) Article BACKGROUND: Electronic health records (EHRs) are potentially important components in addressing pediatric obesity in clinical settings and at the population level. This work aims to identify temporal condition patterns surrounding obesity incidence in a large pediatric population that may inform clinical care and childhood obesity policy and prevention efforts. METHODS: EHR data from healthcare visits with an initial record of obesity incidence (index visit) from 2009 through 2016 at the Children’s Hospital of Philadelphia, and visits immediately before (pre-index) and after (post-index), were compared with a matched control population of patients with a healthy weight to characterize the prevalence of common diagnoses and condition trajectories. The study population consisted of 49,694 patients with pediatric obesity and their corresponding matched controls. The SPADE algorithm was used to identify common temporal condition patterns in the case population. McNemar’s test was used to assess the statistical significance of pattern prevalence differences between the case and control populations. RESULTS: SPADE identified 163 condition patterns that were present in at least 1% of cases; 80 were significantly more common among cases and 45 were significantly more common among controls (p < 0.05). Asthma and allergic rhinitis were strongly associated with childhood obesity incidence, particularly during the pre-index and index visits. Seven conditions were commonly diagnosed for cases exclusively during pre-index visits, including ear, nose, and throat disorders and gastroenteritis. CONCLUSIONS: The novel application of SPADE on a large retrospective dataset revealed temporally dependent condition associations with obesity incidence. Allergic rhinitis and asthma had a particularly high prevalence during pre-index visits. These conditions, along with those exclusively observed during pre-index visits, may represent signals of future obesity. While causation cannot be inferred from these associations, the temporal condition patterns identified here represent hypotheses that can be investigated to determine causal relationships in future obesity research. Nature Publishing Group UK 2020-06-03 2020 /pmc/articles/PMC7381422/ /pubmed/32494036 http://dx.doi.org/10.1038/s41366-020-0614-7 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Campbell, Elizabeth A.
Qian, Ting
Miller, Jeffrey M.
Bass, Ellen J.
Masino, Aaron J.
Identification of temporal condition patterns associated with pediatric obesity incidence using sequence mining and big data
title Identification of temporal condition patterns associated with pediatric obesity incidence using sequence mining and big data
title_full Identification of temporal condition patterns associated with pediatric obesity incidence using sequence mining and big data
title_fullStr Identification of temporal condition patterns associated with pediatric obesity incidence using sequence mining and big data
title_full_unstemmed Identification of temporal condition patterns associated with pediatric obesity incidence using sequence mining and big data
title_short Identification of temporal condition patterns associated with pediatric obesity incidence using sequence mining and big data
title_sort identification of temporal condition patterns associated with pediatric obesity incidence using sequence mining and big data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7381422/
https://www.ncbi.nlm.nih.gov/pubmed/32494036
http://dx.doi.org/10.1038/s41366-020-0614-7
work_keys_str_mv AT campbellelizabetha identificationoftemporalconditionpatternsassociatedwithpediatricobesityincidenceusingsequenceminingandbigdata
AT qianting identificationoftemporalconditionpatternsassociatedwithpediatricobesityincidenceusingsequenceminingandbigdata
AT millerjeffreym identificationoftemporalconditionpatternsassociatedwithpediatricobesityincidenceusingsequenceminingandbigdata
AT bassellenj identificationoftemporalconditionpatternsassociatedwithpediatricobesityincidenceusingsequenceminingandbigdata
AT masinoaaronj identificationoftemporalconditionpatternsassociatedwithpediatricobesityincidenceusingsequenceminingandbigdata