Cargando…

Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study

BACKGROUND: In clinical research, important variables may be collected from multiple data sources. Physical pooling of patient-level data from multiple sources often raises several challenges, including proper protection of patient privacy and proprietary interests. We previously developed an SAS-ba...

Descripción completa

Detalles Bibliográficos
Autores principales: Her, Qoua, Kent, Thomas, Samizo, Yuji, Slavkovic, Aleksandra, Vilk, Yury, Toh, Sengwee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8105763/
https://www.ncbi.nlm.nih.gov/pubmed/33890866
http://dx.doi.org/10.2196/21459
_version_ 1783689665352564736
author Her, Qoua
Kent, Thomas
Samizo, Yuji
Slavkovic, Aleksandra
Vilk, Yury
Toh, Sengwee
author_facet Her, Qoua
Kent, Thomas
Samizo, Yuji
Slavkovic, Aleksandra
Vilk, Yury
Toh, Sengwee
author_sort Her, Qoua
collection PubMed
description BACKGROUND: In clinical research, important variables may be collected from multiple data sources. Physical pooling of patient-level data from multiple sources often raises several challenges, including proper protection of patient privacy and proprietary interests. We previously developed an SAS-based package to perform distributed regression—a suite of privacy-protecting methods that perform multivariable-adjusted regression analysis using only summary-level information—with horizontally partitioned data, a setting where distinct cohorts of patients are available from different data sources. We integrated the package with PopMedNet, an open-source file transfer software, to facilitate secure file transfer between the analysis center and the data-contributing sites. The feasibility of using PopMedNet to facilitate distributed regression analysis (DRA) with vertically partitioned data, a setting where the data attributes from a cohort of patients are available from different data sources, was unknown. OBJECTIVE: The objective of the study was to describe the feasibility of using PopMedNet and enhancements to PopMedNet to facilitate automatable vertical DRA (vDRA) in real-world settings. METHODS: We gathered the statistical and informatic requirements of using PopMedNet to facilitate automatable vDRA. We enhanced PopMedNet based on these requirements to improve its technical capability to support vDRA. RESULTS: PopMedNet can enable automatable vDRA. We identified and implemented two enhancements to PopMedNet that improved its technical capability to perform automatable vDRA in real-world settings. The first was the ability to simultaneously upload and download multiple files, and the second was the ability to directly transfer summary-level information between the data-contributing sites without a third-party analysis center. CONCLUSIONS: PopMedNet can be used to facilitate automatable vDRA to protect patient privacy and support clinical research in real-world settings.
format Online
Article
Text
id pubmed-8105763
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-81057632021-05-12 Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study Her, Qoua Kent, Thomas Samizo, Yuji Slavkovic, Aleksandra Vilk, Yury Toh, Sengwee JMIR Med Inform Original Paper BACKGROUND: In clinical research, important variables may be collected from multiple data sources. Physical pooling of patient-level data from multiple sources often raises several challenges, including proper protection of patient privacy and proprietary interests. We previously developed an SAS-based package to perform distributed regression—a suite of privacy-protecting methods that perform multivariable-adjusted regression analysis using only summary-level information—with horizontally partitioned data, a setting where distinct cohorts of patients are available from different data sources. We integrated the package with PopMedNet, an open-source file transfer software, to facilitate secure file transfer between the analysis center and the data-contributing sites. The feasibility of using PopMedNet to facilitate distributed regression analysis (DRA) with vertically partitioned data, a setting where the data attributes from a cohort of patients are available from different data sources, was unknown. OBJECTIVE: The objective of the study was to describe the feasibility of using PopMedNet and enhancements to PopMedNet to facilitate automatable vertical DRA (vDRA) in real-world settings. METHODS: We gathered the statistical and informatic requirements of using PopMedNet to facilitate automatable vDRA. We enhanced PopMedNet based on these requirements to improve its technical capability to support vDRA. RESULTS: PopMedNet can enable automatable vDRA. We identified and implemented two enhancements to PopMedNet that improved its technical capability to perform automatable vDRA in real-world settings. The first was the ability to simultaneously upload and download multiple files, and the second was the ability to directly transfer summary-level information between the data-contributing sites without a third-party analysis center. CONCLUSIONS: PopMedNet can be used to facilitate automatable vDRA to protect patient privacy and support clinical research in real-world settings. JMIR Publications 2021-04-23 /pmc/articles/PMC8105763/ /pubmed/33890866 http://dx.doi.org/10.2196/21459 Text en ©Qoua Her, Thomas Kent, Yuji Samizo, Aleksandra Slavkovic, Yury Vilk, Sengwee Toh. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 23.04.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Her, Qoua
Kent, Thomas
Samizo, Yuji
Slavkovic, Aleksandra
Vilk, Yury
Toh, Sengwee
Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study
title Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study
title_full Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study
title_fullStr Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study
title_full_unstemmed Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study
title_short Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study
title_sort automatable distributed regression analysis of vertically partitioned data facilitated by popmednet: feasibility and enhancement study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8105763/
https://www.ncbi.nlm.nih.gov/pubmed/33890866
http://dx.doi.org/10.2196/21459
work_keys_str_mv AT herqoua automatabledistributedregressionanalysisofverticallypartitioneddatafacilitatedbypopmednetfeasibilityandenhancementstudy
AT kentthomas automatabledistributedregressionanalysisofverticallypartitioneddatafacilitatedbypopmednetfeasibilityandenhancementstudy
AT samizoyuji automatabledistributedregressionanalysisofverticallypartitioneddatafacilitatedbypopmednetfeasibilityandenhancementstudy
AT slavkovicaleksandra automatabledistributedregressionanalysisofverticallypartitioneddatafacilitatedbypopmednetfeasibilityandenhancementstudy
AT vilkyury automatabledistributedregressionanalysisofverticallypartitioneddatafacilitatedbypopmednetfeasibilityandenhancementstudy
AT tohsengwee automatabledistributedregressionanalysisofverticallypartitioneddatafacilitatedbypopmednetfeasibilityandenhancementstudy