Cargando…
Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study
BACKGROUND: In clinical research, important variables may be collected from multiple data sources. Physical pooling of patient-level data from multiple sources often raises several challenges, including proper protection of patient privacy and proprietary interests. We previously developed an SAS-ba...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8105763/ https://www.ncbi.nlm.nih.gov/pubmed/33890866 http://dx.doi.org/10.2196/21459 |
_version_ | 1783689665352564736 |
---|---|
author | Her, Qoua Kent, Thomas Samizo, Yuji Slavkovic, Aleksandra Vilk, Yury Toh, Sengwee |
author_facet | Her, Qoua Kent, Thomas Samizo, Yuji Slavkovic, Aleksandra Vilk, Yury Toh, Sengwee |
author_sort | Her, Qoua |
collection | PubMed |
description | BACKGROUND: In clinical research, important variables may be collected from multiple data sources. Physical pooling of patient-level data from multiple sources often raises several challenges, including proper protection of patient privacy and proprietary interests. We previously developed an SAS-based package to perform distributed regression—a suite of privacy-protecting methods that perform multivariable-adjusted regression analysis using only summary-level information—with horizontally partitioned data, a setting where distinct cohorts of patients are available from different data sources. We integrated the package with PopMedNet, an open-source file transfer software, to facilitate secure file transfer between the analysis center and the data-contributing sites. The feasibility of using PopMedNet to facilitate distributed regression analysis (DRA) with vertically partitioned data, a setting where the data attributes from a cohort of patients are available from different data sources, was unknown. OBJECTIVE: The objective of the study was to describe the feasibility of using PopMedNet and enhancements to PopMedNet to facilitate automatable vertical DRA (vDRA) in real-world settings. METHODS: We gathered the statistical and informatic requirements of using PopMedNet to facilitate automatable vDRA. We enhanced PopMedNet based on these requirements to improve its technical capability to support vDRA. RESULTS: PopMedNet can enable automatable vDRA. We identified and implemented two enhancements to PopMedNet that improved its technical capability to perform automatable vDRA in real-world settings. The first was the ability to simultaneously upload and download multiple files, and the second was the ability to directly transfer summary-level information between the data-contributing sites without a third-party analysis center. CONCLUSIONS: PopMedNet can be used to facilitate automatable vDRA to protect patient privacy and support clinical research in real-world settings. |
format | Online Article Text |
id | pubmed-8105763 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-81057632021-05-12 Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study Her, Qoua Kent, Thomas Samizo, Yuji Slavkovic, Aleksandra Vilk, Yury Toh, Sengwee JMIR Med Inform Original Paper BACKGROUND: In clinical research, important variables may be collected from multiple data sources. Physical pooling of patient-level data from multiple sources often raises several challenges, including proper protection of patient privacy and proprietary interests. We previously developed an SAS-based package to perform distributed regression—a suite of privacy-protecting methods that perform multivariable-adjusted regression analysis using only summary-level information—with horizontally partitioned data, a setting where distinct cohorts of patients are available from different data sources. We integrated the package with PopMedNet, an open-source file transfer software, to facilitate secure file transfer between the analysis center and the data-contributing sites. The feasibility of using PopMedNet to facilitate distributed regression analysis (DRA) with vertically partitioned data, a setting where the data attributes from a cohort of patients are available from different data sources, was unknown. OBJECTIVE: The objective of the study was to describe the feasibility of using PopMedNet and enhancements to PopMedNet to facilitate automatable vertical DRA (vDRA) in real-world settings. METHODS: We gathered the statistical and informatic requirements of using PopMedNet to facilitate automatable vDRA. We enhanced PopMedNet based on these requirements to improve its technical capability to support vDRA. RESULTS: PopMedNet can enable automatable vDRA. We identified and implemented two enhancements to PopMedNet that improved its technical capability to perform automatable vDRA in real-world settings. The first was the ability to simultaneously upload and download multiple files, and the second was the ability to directly transfer summary-level information between the data-contributing sites without a third-party analysis center. CONCLUSIONS: PopMedNet can be used to facilitate automatable vDRA to protect patient privacy and support clinical research in real-world settings. JMIR Publications 2021-04-23 /pmc/articles/PMC8105763/ /pubmed/33890866 http://dx.doi.org/10.2196/21459 Text en ©Qoua Her, Thomas Kent, Yuji Samizo, Aleksandra Slavkovic, Yury Vilk, Sengwee Toh. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 23.04.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Her, Qoua Kent, Thomas Samizo, Yuji Slavkovic, Aleksandra Vilk, Yury Toh, Sengwee Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study |
title | Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study |
title_full | Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study |
title_fullStr | Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study |
title_full_unstemmed | Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study |
title_short | Automatable Distributed Regression Analysis of Vertically Partitioned Data Facilitated by PopMedNet: Feasibility and Enhancement Study |
title_sort | automatable distributed regression analysis of vertically partitioned data facilitated by popmednet: feasibility and enhancement study |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8105763/ https://www.ncbi.nlm.nih.gov/pubmed/33890866 http://dx.doi.org/10.2196/21459 |
work_keys_str_mv | AT herqoua automatabledistributedregressionanalysisofverticallypartitioneddatafacilitatedbypopmednetfeasibilityandenhancementstudy AT kentthomas automatabledistributedregressionanalysisofverticallypartitioneddatafacilitatedbypopmednetfeasibilityandenhancementstudy AT samizoyuji automatabledistributedregressionanalysisofverticallypartitioneddatafacilitatedbypopmednetfeasibilityandenhancementstudy AT slavkovicaleksandra automatabledistributedregressionanalysisofverticallypartitioneddatafacilitatedbypopmednetfeasibilityandenhancementstudy AT vilkyury automatabledistributedregressionanalysisofverticallypartitioneddatafacilitatedbypopmednetfeasibilityandenhancementstudy AT tohsengwee automatabledistributedregressionanalysisofverticallypartitioneddatafacilitatedbypopmednetfeasibilityandenhancementstudy |