Cargando…

Blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated COVID-19 data analysis from multiple institutions

OBJECTIVE: We aimed to develop a distributed, immutable, and highly available cross-cloud blockchain system to facilitate federated data analysis activities among multiple institutions. MATERIALS AND METHODS: We preprocessed 9166 COVID-19 Structured Query Language (SQL) code, summary statistics, and...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuo, Tsung-Ting, Pham, Anh, Edelson, Maxim E, Kim, Jihoon, Chan, Jason, Gupta, Yash, Ohno-Machado, Lucila
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10198529/
https://www.ncbi.nlm.nih.gov/pubmed/36916740
http://dx.doi.org/10.1093/jamia/ocad049
_version_ 1785044754375901184
author Kuo, Tsung-Ting
Pham, Anh
Edelson, Maxim E
Kim, Jihoon
Chan, Jason
Gupta, Yash
Ohno-Machado, Lucila
author_facet Kuo, Tsung-Ting
Pham, Anh
Edelson, Maxim E
Kim, Jihoon
Chan, Jason
Gupta, Yash
Ohno-Machado, Lucila
author_sort Kuo, Tsung-Ting
collection PubMed
description OBJECTIVE: We aimed to develop a distributed, immutable, and highly available cross-cloud blockchain system to facilitate federated data analysis activities among multiple institutions. MATERIALS AND METHODS: We preprocessed 9166 COVID-19 Structured Query Language (SQL) code, summary statistics, and user activity logs, from the GitHub repository of the Reliable Response Data Discovery for COVID-19 (R2D2) Consortium. The repository collected local summary statistics from participating institutions and aggregated the global result to a COVID-19-related clinical query, previously posted by clinicians on a website. We developed both on-chain and off-chain components to store/query these activity logs and their associated queries/results on a blockchain for immutability, transparency, and high availability of research communication. We measured run-time efficiency of contract deployment, network transactions, and confirmed the accuracy of recorded logs compared to a centralized baseline solution. RESULTS: The smart contract deployment took 4.5 s on an average. The time to record an activity log on blockchain was slightly over 2 s, versus 5–9 s for baseline. For querying, each query took on an average less than 0.4 s on blockchain, versus around 2.1 s for baseline. DISCUSSION: The low deployment, recording, and querying times confirm the feasibility of our cross-cloud, blockchain-based federated data analysis system. We have yet to evaluate the system on a larger network with multiple nodes per cloud, to consider how to accommodate a surge in activities, and to investigate methods to lower querying time as the blockchain grows. CONCLUSION: Blockchain technology can be used to support federated data analysis among multiple institutions.
format Online
Article
Text
id pubmed-10198529
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101985292023-05-20 Blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated COVID-19 data analysis from multiple institutions Kuo, Tsung-Ting Pham, Anh Edelson, Maxim E Kim, Jihoon Chan, Jason Gupta, Yash Ohno-Machado, Lucila J Am Med Inform Assoc Research and Applications OBJECTIVE: We aimed to develop a distributed, immutable, and highly available cross-cloud blockchain system to facilitate federated data analysis activities among multiple institutions. MATERIALS AND METHODS: We preprocessed 9166 COVID-19 Structured Query Language (SQL) code, summary statistics, and user activity logs, from the GitHub repository of the Reliable Response Data Discovery for COVID-19 (R2D2) Consortium. The repository collected local summary statistics from participating institutions and aggregated the global result to a COVID-19-related clinical query, previously posted by clinicians on a website. We developed both on-chain and off-chain components to store/query these activity logs and their associated queries/results on a blockchain for immutability, transparency, and high availability of research communication. We measured run-time efficiency of contract deployment, network transactions, and confirmed the accuracy of recorded logs compared to a centralized baseline solution. RESULTS: The smart contract deployment took 4.5 s on an average. The time to record an activity log on blockchain was slightly over 2 s, versus 5–9 s for baseline. For querying, each query took on an average less than 0.4 s on blockchain, versus around 2.1 s for baseline. DISCUSSION: The low deployment, recording, and querying times confirm the feasibility of our cross-cloud, blockchain-based federated data analysis system. We have yet to evaluate the system on a larger network with multiple nodes per cloud, to consider how to accommodate a surge in activities, and to investigate methods to lower querying time as the blockchain grows. CONCLUSION: Blockchain technology can be used to support federated data analysis among multiple institutions. Oxford University Press 2023-03-14 /pmc/articles/PMC10198529/ /pubmed/36916740 http://dx.doi.org/10.1093/jamia/ocad049 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research and Applications
Kuo, Tsung-Ting
Pham, Anh
Edelson, Maxim E
Kim, Jihoon
Chan, Jason
Gupta, Yash
Ohno-Machado, Lucila
Blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated COVID-19 data analysis from multiple institutions
title Blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated COVID-19 data analysis from multiple institutions
title_full Blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated COVID-19 data analysis from multiple institutions
title_fullStr Blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated COVID-19 data analysis from multiple institutions
title_full_unstemmed Blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated COVID-19 data analysis from multiple institutions
title_short Blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated COVID-19 data analysis from multiple institutions
title_sort blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated covid-19 data analysis from multiple institutions
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10198529/
https://www.ncbi.nlm.nih.gov/pubmed/36916740
http://dx.doi.org/10.1093/jamia/ocad049
work_keys_str_mv AT kuotsungting blockchainenabledimmutabledistributedandhighlyavailableclinicalresearchactivityloggingsystemforfederatedcovid19dataanalysisfrommultipleinstitutions
AT phamanh blockchainenabledimmutabledistributedandhighlyavailableclinicalresearchactivityloggingsystemforfederatedcovid19dataanalysisfrommultipleinstitutions
AT edelsonmaxime blockchainenabledimmutabledistributedandhighlyavailableclinicalresearchactivityloggingsystemforfederatedcovid19dataanalysisfrommultipleinstitutions
AT kimjihoon blockchainenabledimmutabledistributedandhighlyavailableclinicalresearchactivityloggingsystemforfederatedcovid19dataanalysisfrommultipleinstitutions
AT chanjason blockchainenabledimmutabledistributedandhighlyavailableclinicalresearchactivityloggingsystemforfederatedcovid19dataanalysisfrommultipleinstitutions
AT guptayash blockchainenabledimmutabledistributedandhighlyavailableclinicalresearchactivityloggingsystemforfederatedcovid19dataanalysisfrommultipleinstitutions
AT ohnomachadolucila blockchainenabledimmutabledistributedandhighlyavailableclinicalresearchactivityloggingsystemforfederatedcovid19dataanalysisfrommultipleinstitutions
AT blockchainenabledimmutabledistributedandhighlyavailableclinicalresearchactivityloggingsystemforfederatedcovid19dataanalysisfrommultipleinstitutions