Cargando…

Privacy-protecting, reliable response data discovery using COVID-19 patient observations

OBJECTIVE: To utilize, in an individual and institutional privacy-preserving manner, electronic health record (EHR) data from 202 hospitals by analyzing answers to COVID-19-related questions and posting these answers online. MATERIALS AND METHODS: We developed a distributed, federated network of 12...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Jihoon, Neumann, Larissa, Paul, Paulina, Day, Michele E, Aratow, Michael, Bell, Douglas S, Doctor, Jason N, Hinske, Ludwig C, Jiang, Xiaoqian, Kim, Katherine K, Matheny, Michael E, Meeker, Daniella, Pletcher, Mark J, Schilling, Lisa M, SooHoo, Spencer, Xu, Hua, Zheng, Kai, Ohno-Machado, Lucila
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8194878/
https://www.ncbi.nlm.nih.gov/pubmed/34051088
http://dx.doi.org/10.1093/jamia/ocab054
_version_ 1783706448407035904
author Kim, Jihoon
Neumann, Larissa
Paul, Paulina
Day, Michele E
Aratow, Michael
Bell, Douglas S
Doctor, Jason N
Hinske, Ludwig C
Jiang, Xiaoqian
Kim, Katherine K
Matheny, Michael E
Meeker, Daniella
Pletcher, Mark J
Schilling, Lisa M
SooHoo, Spencer
Xu, Hua
Zheng, Kai
Ohno-Machado, Lucila
author_facet Kim, Jihoon
Neumann, Larissa
Paul, Paulina
Day, Michele E
Aratow, Michael
Bell, Douglas S
Doctor, Jason N
Hinske, Ludwig C
Jiang, Xiaoqian
Kim, Katherine K
Matheny, Michael E
Meeker, Daniella
Pletcher, Mark J
Schilling, Lisa M
SooHoo, Spencer
Xu, Hua
Zheng, Kai
Ohno-Machado, Lucila
author_sort Kim, Jihoon
collection PubMed
description OBJECTIVE: To utilize, in an individual and institutional privacy-preserving manner, electronic health record (EHR) data from 202 hospitals by analyzing answers to COVID-19-related questions and posting these answers online. MATERIALS AND METHODS: We developed a distributed, federated network of 12 health systems that harmonized their EHRs and submitted aggregate answers to consortia questions posted at https://www.covid19questions.org. Our consortium developed processes and implemented distributed algorithms to produce answers to a variety of questions. We were able to generate counts, descriptive statistics, and build a multivariate, iterative regression model without centralizing individual-level data. RESULTS: Our public website contains answers to various clinical questions, a web form for users to ask questions in natural language, and a list of items that are currently pending responses. The results show, for example, that patients who were taking angiotensin-converting enzyme inhibitors and angiotensin II receptor blockers, within the year before admission, had lower unadjusted in-hospital mortality rates. We also showed that, when adjusted for, age, sex, and ethnicity were not significantly associated with mortality. We demonstrated that it is possible to answer questions about COVID-19 using EHR data from systems that have different policies and must follow various regulations, without moving data out of their health systems. DISCUSSION AND CONCLUSIONS: We present an alternative or a complement to centralized COVID-19 registries of EHR data. We can use multivariate distributed logistic regression on observations recorded in the process of care to generate results without transferring individual-level data outside the health systems.
format Online
Article
Text
id pubmed-8194878
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-81948782021-06-15 Privacy-protecting, reliable response data discovery using COVID-19 patient observations Kim, Jihoon Neumann, Larissa Paul, Paulina Day, Michele E Aratow, Michael Bell, Douglas S Doctor, Jason N Hinske, Ludwig C Jiang, Xiaoqian Kim, Katherine K Matheny, Michael E Meeker, Daniella Pletcher, Mark J Schilling, Lisa M SooHoo, Spencer Xu, Hua Zheng, Kai Ohno-Machado, Lucila J Am Med Inform Assoc Research and Applications OBJECTIVE: To utilize, in an individual and institutional privacy-preserving manner, electronic health record (EHR) data from 202 hospitals by analyzing answers to COVID-19-related questions and posting these answers online. MATERIALS AND METHODS: We developed a distributed, federated network of 12 health systems that harmonized their EHRs and submitted aggregate answers to consortia questions posted at https://www.covid19questions.org. Our consortium developed processes and implemented distributed algorithms to produce answers to a variety of questions. We were able to generate counts, descriptive statistics, and build a multivariate, iterative regression model without centralizing individual-level data. RESULTS: Our public website contains answers to various clinical questions, a web form for users to ask questions in natural language, and a list of items that are currently pending responses. The results show, for example, that patients who were taking angiotensin-converting enzyme inhibitors and angiotensin II receptor blockers, within the year before admission, had lower unadjusted in-hospital mortality rates. We also showed that, when adjusted for, age, sex, and ethnicity were not significantly associated with mortality. We demonstrated that it is possible to answer questions about COVID-19 using EHR data from systems that have different policies and must follow various regulations, without moving data out of their health systems. DISCUSSION AND CONCLUSIONS: We present an alternative or a complement to centralized COVID-19 registries of EHR data. We can use multivariate distributed logistic regression on observations recorded in the process of care to generate results without transferring individual-level data outside the health systems. Oxford University Press 2021-05-29 /pmc/articles/PMC8194878/ /pubmed/34051088 http://dx.doi.org/10.1093/jamia/ocab054 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research and Applications
Kim, Jihoon
Neumann, Larissa
Paul, Paulina
Day, Michele E
Aratow, Michael
Bell, Douglas S
Doctor, Jason N
Hinske, Ludwig C
Jiang, Xiaoqian
Kim, Katherine K
Matheny, Michael E
Meeker, Daniella
Pletcher, Mark J
Schilling, Lisa M
SooHoo, Spencer
Xu, Hua
Zheng, Kai
Ohno-Machado, Lucila
Privacy-protecting, reliable response data discovery using COVID-19 patient observations
title Privacy-protecting, reliable response data discovery using COVID-19 patient observations
title_full Privacy-protecting, reliable response data discovery using COVID-19 patient observations
title_fullStr Privacy-protecting, reliable response data discovery using COVID-19 patient observations
title_full_unstemmed Privacy-protecting, reliable response data discovery using COVID-19 patient observations
title_short Privacy-protecting, reliable response data discovery using COVID-19 patient observations
title_sort privacy-protecting, reliable response data discovery using covid-19 patient observations
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8194878/
https://www.ncbi.nlm.nih.gov/pubmed/34051088
http://dx.doi.org/10.1093/jamia/ocab054
work_keys_str_mv AT kimjihoon privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT neumannlarissa privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT paulpaulina privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT daymichelee privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT aratowmichael privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT belldouglass privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT doctorjasonn privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT hinskeludwigc privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT jiangxiaoqian privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT kimkatherinek privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT mathenymichaele privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT meekerdaniella privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT pletchermarkj privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT schillinglisam privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT soohoospencer privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT xuhua privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT zhengkai privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT ohnomachadolucila privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations
AT privacyprotectingreliableresponsedatadiscoveryusingcovid19patientobservations