Cargando…

Privacy-aware multi-institutional time-to-event studies

Clinical time-to-event studies are dependent on large sample sizes, often not available at a single institution. However, this is countered by the fact that, particularly in the medical field, individual institutions are often legally unable to share their data, as medical data is subject to strong...

Descripción completa

Detalles Bibliográficos
Autores principales: Späth, Julian, Matschinske, Julian, Kamanu, Frederick K., Murphy, Sabina A., Zolotareva, Olga, Bakhtiari, Mohammad, Antman, Elliott M., Loscalzo, Joseph, Brauneck, Alissa, Schmalhorst, Louisa, Buchholtz, Gabriele, Baumbach, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931301/
https://www.ncbi.nlm.nih.gov/pubmed/36812603
http://dx.doi.org/10.1371/journal.pdig.0000101
_version_ 1784889219247767552
author Späth, Julian
Matschinske, Julian
Kamanu, Frederick K.
Murphy, Sabina A.
Zolotareva, Olga
Bakhtiari, Mohammad
Antman, Elliott M.
Loscalzo, Joseph
Brauneck, Alissa
Schmalhorst, Louisa
Buchholtz, Gabriele
Baumbach, Jan
author_facet Späth, Julian
Matschinske, Julian
Kamanu, Frederick K.
Murphy, Sabina A.
Zolotareva, Olga
Bakhtiari, Mohammad
Antman, Elliott M.
Loscalzo, Joseph
Brauneck, Alissa
Schmalhorst, Louisa
Buchholtz, Gabriele
Baumbach, Jan
author_sort Späth, Julian
collection PubMed
description Clinical time-to-event studies are dependent on large sample sizes, often not available at a single institution. However, this is countered by the fact that, particularly in the medical field, individual institutions are often legally unable to share their data, as medical data is subject to strong privacy protection due to its particular sensitivity. But the collection, and especially aggregation into centralized datasets, is also fraught with substantial legal risks and often outright unlawful. Existing solutions using federated learning have already demonstrated considerable potential as an alternative for central data collection. Unfortunately, current approaches are incomplete or not easily applicable in clinical studies owing to the complexity of federated infrastructures. This work presents privacy-aware and federated implementations of the most used time-to-event algorithms (survival curve, cumulative hazard rate, log-rank test, and Cox proportional hazards model) in clinical trials, based on a hybrid approach of federated learning, additive secret sharing, and differential privacy. On several benchmark datasets, we show that all algorithms produce highly similar, or in some cases, even identical results compared to traditional centralized time-to-event algorithms. Furthermore, we were able to reproduce the results of a previous clinical time-to-event study in various federated scenarios. All algorithms are accessible through the intuitive web-app Partea (https://partea.zbh.uni-hamburg.de), offering a graphical user interface for clinicians and non-computational researchers without programming knowledge. Partea removes the high infrastructural hurdles derived from existing federated learning approaches and removes the complexity of execution. Therefore, it is an easy-to-use alternative to central data collection, reducing bureaucratic efforts but also the legal risks associated with the processing of personal data to a minimum.
format Online
Article
Text
id pubmed-9931301
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-99313012023-02-16 Privacy-aware multi-institutional time-to-event studies Späth, Julian Matschinske, Julian Kamanu, Frederick K. Murphy, Sabina A. Zolotareva, Olga Bakhtiari, Mohammad Antman, Elliott M. Loscalzo, Joseph Brauneck, Alissa Schmalhorst, Louisa Buchholtz, Gabriele Baumbach, Jan PLOS Digit Health Research Article Clinical time-to-event studies are dependent on large sample sizes, often not available at a single institution. However, this is countered by the fact that, particularly in the medical field, individual institutions are often legally unable to share their data, as medical data is subject to strong privacy protection due to its particular sensitivity. But the collection, and especially aggregation into centralized datasets, is also fraught with substantial legal risks and often outright unlawful. Existing solutions using federated learning have already demonstrated considerable potential as an alternative for central data collection. Unfortunately, current approaches are incomplete or not easily applicable in clinical studies owing to the complexity of federated infrastructures. This work presents privacy-aware and federated implementations of the most used time-to-event algorithms (survival curve, cumulative hazard rate, log-rank test, and Cox proportional hazards model) in clinical trials, based on a hybrid approach of federated learning, additive secret sharing, and differential privacy. On several benchmark datasets, we show that all algorithms produce highly similar, or in some cases, even identical results compared to traditional centralized time-to-event algorithms. Furthermore, we were able to reproduce the results of a previous clinical time-to-event study in various federated scenarios. All algorithms are accessible through the intuitive web-app Partea (https://partea.zbh.uni-hamburg.de), offering a graphical user interface for clinicians and non-computational researchers without programming knowledge. Partea removes the high infrastructural hurdles derived from existing federated learning approaches and removes the complexity of execution. Therefore, it is an easy-to-use alternative to central data collection, reducing bureaucratic efforts but also the legal risks associated with the processing of personal data to a minimum. Public Library of Science 2022-09-06 /pmc/articles/PMC9931301/ /pubmed/36812603 http://dx.doi.org/10.1371/journal.pdig.0000101 Text en © 2022 Späth et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Späth, Julian
Matschinske, Julian
Kamanu, Frederick K.
Murphy, Sabina A.
Zolotareva, Olga
Bakhtiari, Mohammad
Antman, Elliott M.
Loscalzo, Joseph
Brauneck, Alissa
Schmalhorst, Louisa
Buchholtz, Gabriele
Baumbach, Jan
Privacy-aware multi-institutional time-to-event studies
title Privacy-aware multi-institutional time-to-event studies
title_full Privacy-aware multi-institutional time-to-event studies
title_fullStr Privacy-aware multi-institutional time-to-event studies
title_full_unstemmed Privacy-aware multi-institutional time-to-event studies
title_short Privacy-aware multi-institutional time-to-event studies
title_sort privacy-aware multi-institutional time-to-event studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931301/
https://www.ncbi.nlm.nih.gov/pubmed/36812603
http://dx.doi.org/10.1371/journal.pdig.0000101
work_keys_str_mv AT spathjulian privacyawaremultiinstitutionaltimetoeventstudies
AT matschinskejulian privacyawaremultiinstitutionaltimetoeventstudies
AT kamanufrederickk privacyawaremultiinstitutionaltimetoeventstudies
AT murphysabinaa privacyawaremultiinstitutionaltimetoeventstudies
AT zolotarevaolga privacyawaremultiinstitutionaltimetoeventstudies
AT bakhtiarimohammad privacyawaremultiinstitutionaltimetoeventstudies
AT antmanelliottm privacyawaremultiinstitutionaltimetoeventstudies
AT loscalzojoseph privacyawaremultiinstitutionaltimetoeventstudies
AT brauneckalissa privacyawaremultiinstitutionaltimetoeventstudies
AT schmalhorstlouisa privacyawaremultiinstitutionaltimetoeventstudies
AT buchholtzgabriele privacyawaremultiinstitutionaltimetoeventstudies
AT baumbachjan privacyawaremultiinstitutionaltimetoeventstudies