Cargando…

Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records

OBJECTIVE: To provide an open-source software package for determining temporal correlations between disease states using longitudinal electronic medical records (EMR). MATERIALS AND METHODS: We have developed an R-based package, Disease Correlation Network (DCN), which builds retrospective matched c...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Huaiying, Rong, Ruichen, Gao, Xiang, Revanna, Kashi, Zhao, Michael, Bajic, Petar, Jin, David, Hu, Chengjun, Dong, Qunfeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6952009/
https://www.ncbi.nlm.nih.gov/pubmed/31984368
http://dx.doi.org/10.1093/jamiaopen/ooz031
_version_ 1783486372831559680
author Lin, Huaiying
Rong, Ruichen
Gao, Xiang
Revanna, Kashi
Zhao, Michael
Bajic, Petar
Jin, David
Hu, Chengjun
Dong, Qunfeng
author_facet Lin, Huaiying
Rong, Ruichen
Gao, Xiang
Revanna, Kashi
Zhao, Michael
Bajic, Petar
Jin, David
Hu, Chengjun
Dong, Qunfeng
author_sort Lin, Huaiying
collection PubMed
description OBJECTIVE: To provide an open-source software package for determining temporal correlations between disease states using longitudinal electronic medical records (EMR). MATERIALS AND METHODS: We have developed an R-based package, Disease Correlation Network (DCN), which builds retrospective matched cohorts from longitudinal medical records to assess for significant temporal correlations between diseases using two independent methodologies: Cox proportional hazards regression and random forest survival analysis. This optimizable package has the potential to control for relevant confounding factors such as age, gender, and other demographic and medical characteristics. Output is presented as a DCN which may be analyzed using a JavaScript-based interactive visualization tool for users to explore statistically significant correlations between disease states of interest using graph-theory-based network topology. RESULTS: We have applied this package to a longitudinal dataset at Loyola University Chicago Medical Center with 654 084 distinct initial diagnoses of 51 conditions in 175 539 patients. Over 90% of disease correlations identified are supported by literature review. DCN is available for download at https://github.com/qunfengdong/DCN. CONCLUSIONS: DCN allows screening of EMR data to identify potential relationships between chronic disease states. This data may then be used to formulate novel research hypotheses for further characterization of these relationships.
format Online
Article
Text
id pubmed-6952009
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-69520092020-01-24 Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records Lin, Huaiying Rong, Ruichen Gao, Xiang Revanna, Kashi Zhao, Michael Bajic, Petar Jin, David Hu, Chengjun Dong, Qunfeng JAMIA Open Research and Applications OBJECTIVE: To provide an open-source software package for determining temporal correlations between disease states using longitudinal electronic medical records (EMR). MATERIALS AND METHODS: We have developed an R-based package, Disease Correlation Network (DCN), which builds retrospective matched cohorts from longitudinal medical records to assess for significant temporal correlations between diseases using two independent methodologies: Cox proportional hazards regression and random forest survival analysis. This optimizable package has the potential to control for relevant confounding factors such as age, gender, and other demographic and medical characteristics. Output is presented as a DCN which may be analyzed using a JavaScript-based interactive visualization tool for users to explore statistically significant correlations between disease states of interest using graph-theory-based network topology. RESULTS: We have applied this package to a longitudinal dataset at Loyola University Chicago Medical Center with 654 084 distinct initial diagnoses of 51 conditions in 175 539 patients. Over 90% of disease correlations identified are supported by literature review. DCN is available for download at https://github.com/qunfengdong/DCN. CONCLUSIONS: DCN allows screening of EMR data to identify potential relationships between chronic disease states. This data may then be used to formulate novel research hypotheses for further characterization of these relationships. Oxford University Press 2019-08-23 /pmc/articles/PMC6952009/ /pubmed/31984368 http://dx.doi.org/10.1093/jamiaopen/ooz031 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research and Applications
Lin, Huaiying
Rong, Ruichen
Gao, Xiang
Revanna, Kashi
Zhao, Michael
Bajic, Petar
Jin, David
Hu, Chengjun
Dong, Qunfeng
Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records
title Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records
title_full Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records
title_fullStr Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records
title_full_unstemmed Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records
title_short Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records
title_sort disease correlation network: a computational package for identifying temporal correlations between disease states from large-scale longitudinal medical records
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6952009/
https://www.ncbi.nlm.nih.gov/pubmed/31984368
http://dx.doi.org/10.1093/jamiaopen/ooz031
work_keys_str_mv AT linhuaiying diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords
AT rongruichen diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords
AT gaoxiang diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords
AT revannakashi diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords
AT zhaomichael diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords
AT bajicpetar diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords
AT jindavid diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords
AT huchengjun diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords
AT dongqunfeng diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords