Cargando…

ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites

Electronic Health Records (EHR) contain extensive information on various health outcomes and risk factors, and therefore have been broadly used in healthcare research. Integrating EHR data from multiple clinical sites can accelerate knowledge discovery and risk prediction by providing a larger sampl...

Descripción completa

Detalles Bibliográficos
Autores principales: Duan, Rui, Boland, Mary Regina, Moore, Jason H., Chen, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417819/
https://www.ncbi.nlm.nih.gov/pubmed/30864308
_version_ 1783403628160090112
author Duan, Rui
Boland, Mary Regina
Moore, Jason H.
Chen, Yong
author_facet Duan, Rui
Boland, Mary Regina
Moore, Jason H.
Chen, Yong
author_sort Duan, Rui
collection PubMed
description Electronic Health Records (EHR) contain extensive information on various health outcomes and risk factors, and therefore have been broadly used in healthcare research. Integrating EHR data from multiple clinical sites can accelerate knowledge discovery and risk prediction by providing a larger sample size in a more general population which potentially reduces clinical bias and improves estimation and prediction accuracy. To overcome the barrier of patient-level data sharing, distributed algorithms are developed to conduct statistical analyses across multiple sites through sharing only aggregated information. The current distributed algorithm often requires iterative information evaluation and transferring across sites, which can potentially lead to a high communication cost in practical settings. In this study, we propose a privacy-preserving and communication-efficient distributed algorithm for logistic regression without requiring iterative communications across sites. Our simulation study showed our algorithm reached comparative accuracy comparing to the oracle estimator where data are pooled together. We applied our algorithm to an EHR data from the University of Pennsylvania health system to evaluate the risks of fetal loss due to various medication exposures.
format Online
Article
Text
id pubmed-6417819
institution National Center for Biotechnology Information
language English
publishDate 2019
record_format MEDLINE/PubMed
spelling pubmed-64178192019-03-14 ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites Duan, Rui Boland, Mary Regina Moore, Jason H. Chen, Yong Pac Symp Biocomput Article Electronic Health Records (EHR) contain extensive information on various health outcomes and risk factors, and therefore have been broadly used in healthcare research. Integrating EHR data from multiple clinical sites can accelerate knowledge discovery and risk prediction by providing a larger sample size in a more general population which potentially reduces clinical bias and improves estimation and prediction accuracy. To overcome the barrier of patient-level data sharing, distributed algorithms are developed to conduct statistical analyses across multiple sites through sharing only aggregated information. The current distributed algorithm often requires iterative information evaluation and transferring across sites, which can potentially lead to a high communication cost in practical settings. In this study, we propose a privacy-preserving and communication-efficient distributed algorithm for logistic regression without requiring iterative communications across sites. Our simulation study showed our algorithm reached comparative accuracy comparing to the oracle estimator where data are pooled together. We applied our algorithm to an EHR data from the University of Pennsylvania health system to evaluate the risks of fetal loss due to various medication exposures. 2019 /pmc/articles/PMC6417819/ /pubmed/30864308 Text en http://creativecommons.org/licenses/by-nc/4.0/ Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC) 4.0 License.
spellingShingle Article
Duan, Rui
Boland, Mary Regina
Moore, Jason H.
Chen, Yong
ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites
title ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites
title_full ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites
title_fullStr ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites
title_full_unstemmed ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites
title_short ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites
title_sort odal: a one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417819/
https://www.ncbi.nlm.nih.gov/pubmed/30864308
work_keys_str_mv AT duanrui odalaoneshotdistributedalgorithmtoperformlogisticregressionsonelectronichealthrecordsdatafrommultipleclinicalsites
AT bolandmaryregina odalaoneshotdistributedalgorithmtoperformlogisticregressionsonelectronichealthrecordsdatafrommultipleclinicalsites
AT moorejasonh odalaoneshotdistributedalgorithmtoperformlogisticregressionsonelectronichealthrecordsdatafrommultipleclinicalsites
AT chenyong odalaoneshotdistributedalgorithmtoperformlogisticregressionsonelectronichealthrecordsdatafrommultipleclinicalsites