Cargando…
ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites
Electronic Health Records (EHR) contain extensive information on various health outcomes and risk factors, and therefore have been broadly used in healthcare research. Integrating EHR data from multiple clinical sites can accelerate knowledge discovery and risk prediction by providing a larger sampl...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417819/ https://www.ncbi.nlm.nih.gov/pubmed/30864308 |
_version_ | 1783403628160090112 |
---|---|
author | Duan, Rui Boland, Mary Regina Moore, Jason H. Chen, Yong |
author_facet | Duan, Rui Boland, Mary Regina Moore, Jason H. Chen, Yong |
author_sort | Duan, Rui |
collection | PubMed |
description | Electronic Health Records (EHR) contain extensive information on various health outcomes and risk factors, and therefore have been broadly used in healthcare research. Integrating EHR data from multiple clinical sites can accelerate knowledge discovery and risk prediction by providing a larger sample size in a more general population which potentially reduces clinical bias and improves estimation and prediction accuracy. To overcome the barrier of patient-level data sharing, distributed algorithms are developed to conduct statistical analyses across multiple sites through sharing only aggregated information. The current distributed algorithm often requires iterative information evaluation and transferring across sites, which can potentially lead to a high communication cost in practical settings. In this study, we propose a privacy-preserving and communication-efficient distributed algorithm for logistic regression without requiring iterative communications across sites. Our simulation study showed our algorithm reached comparative accuracy comparing to the oracle estimator where data are pooled together. We applied our algorithm to an EHR data from the University of Pennsylvania health system to evaluate the risks of fetal loss due to various medication exposures. |
format | Online Article Text |
id | pubmed-6417819 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
record_format | MEDLINE/PubMed |
spelling | pubmed-64178192019-03-14 ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites Duan, Rui Boland, Mary Regina Moore, Jason H. Chen, Yong Pac Symp Biocomput Article Electronic Health Records (EHR) contain extensive information on various health outcomes and risk factors, and therefore have been broadly used in healthcare research. Integrating EHR data from multiple clinical sites can accelerate knowledge discovery and risk prediction by providing a larger sample size in a more general population which potentially reduces clinical bias and improves estimation and prediction accuracy. To overcome the barrier of patient-level data sharing, distributed algorithms are developed to conduct statistical analyses across multiple sites through sharing only aggregated information. The current distributed algorithm often requires iterative information evaluation and transferring across sites, which can potentially lead to a high communication cost in practical settings. In this study, we propose a privacy-preserving and communication-efficient distributed algorithm for logistic regression without requiring iterative communications across sites. Our simulation study showed our algorithm reached comparative accuracy comparing to the oracle estimator where data are pooled together. We applied our algorithm to an EHR data from the University of Pennsylvania health system to evaluate the risks of fetal loss due to various medication exposures. 2019 /pmc/articles/PMC6417819/ /pubmed/30864308 Text en http://creativecommons.org/licenses/by-nc/4.0/ Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC) 4.0 License. |
spellingShingle | Article Duan, Rui Boland, Mary Regina Moore, Jason H. Chen, Yong ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites |
title | ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites |
title_full | ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites |
title_fullStr | ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites |
title_full_unstemmed | ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites |
title_short | ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites |
title_sort | odal: a one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417819/ https://www.ncbi.nlm.nih.gov/pubmed/30864308 |
work_keys_str_mv | AT duanrui odalaoneshotdistributedalgorithmtoperformlogisticregressionsonelectronichealthrecordsdatafrommultipleclinicalsites AT bolandmaryregina odalaoneshotdistributedalgorithmtoperformlogisticregressionsonelectronichealthrecordsdatafrommultipleclinicalsites AT moorejasonh odalaoneshotdistributedalgorithmtoperformlogisticregressionsonelectronichealthrecordsdatafrommultipleclinicalsites AT chenyong odalaoneshotdistributedalgorithmtoperformlogisticregressionsonelectronichealthrecordsdatafrommultipleclinicalsites |