Cargando…
Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records
OBJECTIVE: To provide an open-source software package for determining temporal correlations between disease states using longitudinal electronic medical records (EMR). MATERIALS AND METHODS: We have developed an R-based package, Disease Correlation Network (DCN), which builds retrospective matched c...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6952009/ https://www.ncbi.nlm.nih.gov/pubmed/31984368 http://dx.doi.org/10.1093/jamiaopen/ooz031 |
_version_ | 1783486372831559680 |
---|---|
author | Lin, Huaiying Rong, Ruichen Gao, Xiang Revanna, Kashi Zhao, Michael Bajic, Petar Jin, David Hu, Chengjun Dong, Qunfeng |
author_facet | Lin, Huaiying Rong, Ruichen Gao, Xiang Revanna, Kashi Zhao, Michael Bajic, Petar Jin, David Hu, Chengjun Dong, Qunfeng |
author_sort | Lin, Huaiying |
collection | PubMed |
description | OBJECTIVE: To provide an open-source software package for determining temporal correlations between disease states using longitudinal electronic medical records (EMR). MATERIALS AND METHODS: We have developed an R-based package, Disease Correlation Network (DCN), which builds retrospective matched cohorts from longitudinal medical records to assess for significant temporal correlations between diseases using two independent methodologies: Cox proportional hazards regression and random forest survival analysis. This optimizable package has the potential to control for relevant confounding factors such as age, gender, and other demographic and medical characteristics. Output is presented as a DCN which may be analyzed using a JavaScript-based interactive visualization tool for users to explore statistically significant correlations between disease states of interest using graph-theory-based network topology. RESULTS: We have applied this package to a longitudinal dataset at Loyola University Chicago Medical Center with 654 084 distinct initial diagnoses of 51 conditions in 175 539 patients. Over 90% of disease correlations identified are supported by literature review. DCN is available for download at https://github.com/qunfengdong/DCN. CONCLUSIONS: DCN allows screening of EMR data to identify potential relationships between chronic disease states. This data may then be used to formulate novel research hypotheses for further characterization of these relationships. |
format | Online Article Text |
id | pubmed-6952009 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-69520092020-01-24 Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records Lin, Huaiying Rong, Ruichen Gao, Xiang Revanna, Kashi Zhao, Michael Bajic, Petar Jin, David Hu, Chengjun Dong, Qunfeng JAMIA Open Research and Applications OBJECTIVE: To provide an open-source software package for determining temporal correlations between disease states using longitudinal electronic medical records (EMR). MATERIALS AND METHODS: We have developed an R-based package, Disease Correlation Network (DCN), which builds retrospective matched cohorts from longitudinal medical records to assess for significant temporal correlations between diseases using two independent methodologies: Cox proportional hazards regression and random forest survival analysis. This optimizable package has the potential to control for relevant confounding factors such as age, gender, and other demographic and medical characteristics. Output is presented as a DCN which may be analyzed using a JavaScript-based interactive visualization tool for users to explore statistically significant correlations between disease states of interest using graph-theory-based network topology. RESULTS: We have applied this package to a longitudinal dataset at Loyola University Chicago Medical Center with 654 084 distinct initial diagnoses of 51 conditions in 175 539 patients. Over 90% of disease correlations identified are supported by literature review. DCN is available for download at https://github.com/qunfengdong/DCN. CONCLUSIONS: DCN allows screening of EMR data to identify potential relationships between chronic disease states. This data may then be used to formulate novel research hypotheses for further characterization of these relationships. Oxford University Press 2019-08-23 /pmc/articles/PMC6952009/ /pubmed/31984368 http://dx.doi.org/10.1093/jamiaopen/ooz031 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research and Applications Lin, Huaiying Rong, Ruichen Gao, Xiang Revanna, Kashi Zhao, Michael Bajic, Petar Jin, David Hu, Chengjun Dong, Qunfeng Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records |
title | Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records |
title_full | Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records |
title_fullStr | Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records |
title_full_unstemmed | Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records |
title_short | Disease correlation network: a computational package for identifying temporal correlations between disease states from Large-Scale longitudinal medical records |
title_sort | disease correlation network: a computational package for identifying temporal correlations between disease states from large-scale longitudinal medical records |
topic | Research and Applications |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6952009/ https://www.ncbi.nlm.nih.gov/pubmed/31984368 http://dx.doi.org/10.1093/jamiaopen/ooz031 |
work_keys_str_mv | AT linhuaiying diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords AT rongruichen diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords AT gaoxiang diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords AT revannakashi diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords AT zhaomichael diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords AT bajicpetar diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords AT jindavid diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords AT huchengjun diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords AT dongqunfeng diseasecorrelationnetworkacomputationalpackageforidentifyingtemporalcorrelationsbetweendiseasestatesfromlargescalelongitudinalmedicalrecords |