Cargando…

Federated learning for preserving data privacy in collaborative healthcare research

Generalizability, external validity, and reproducibility are high priorities for artificial intelligence applications in healthcare. Traditional approaches to addressing these elements involve sharing patient data between institutions or practice settings, which can compromise data privacy (individu...

Descripción completa

Detalles Bibliográficos
Autores principales: Loftus, Tyler J, Ruppert, Matthew M, Shickel, Benjamin, Ozrazgat-Baslanti, Tezcan, Balch, Jeremy A, Efron, Philip A, Upchurch, Gilbert R, Rashidi, Parisa, Tignanelli, Christopher, Bian, Jiang, Bihorac, Azra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9619858/
https://www.ncbi.nlm.nih.gov/pubmed/36325438
http://dx.doi.org/10.1177/20552076221134455
_version_ 1784821289890873344
author Loftus, Tyler J
Ruppert, Matthew M
Shickel, Benjamin
Ozrazgat-Baslanti, Tezcan
Balch, Jeremy A
Efron, Philip A
Upchurch, Gilbert R
Rashidi, Parisa
Tignanelli, Christopher
Bian, Jiang
Bihorac, Azra
author_facet Loftus, Tyler J
Ruppert, Matthew M
Shickel, Benjamin
Ozrazgat-Baslanti, Tezcan
Balch, Jeremy A
Efron, Philip A
Upchurch, Gilbert R
Rashidi, Parisa
Tignanelli, Christopher
Bian, Jiang
Bihorac, Azra
author_sort Loftus, Tyler J
collection PubMed
description Generalizability, external validity, and reproducibility are high priorities for artificial intelligence applications in healthcare. Traditional approaches to addressing these elements involve sharing patient data between institutions or practice settings, which can compromise data privacy (individuals’ right to prevent the sharing and disclosure of information about themselves) and data security (simultaneously preserving confidentiality, accuracy, fidelity, and availability of data). This article describes insights from real-world implementation of federated learning techniques that offer opportunities to maintain both data privacy and availability via collaborative machine learning that shares knowledge, not data. Local models are trained separately on local data. As they train, they send local model updates (e.g. coefficients or gradients) for consolidation into a global model. In some use cases, global models outperform local models on new, previously unseen local datasets, suggesting that collaborative learning from a greater number of examples, including a greater number of rare cases, may improve predictive performance. Even when sharing model updates rather than data, privacy leakage can occur when adversaries perform property or membership inference attacks which can be used to ascertain information about the training set. Emerging techniques mitigate risk from adversarial attacks, allowing investigators to maintain both data privacy and availability in collaborative healthcare research. When data heterogeneity between participating centers is high, personalized algorithms may offer greater generalizability by improving performance on data from centers with proportionately smaller training sample sizes. Properly applied, federated learning has the potential to optimize the reproducibility and performance of collaborative learning while preserving data security and privacy.
format Online
Article
Text
id pubmed-9619858
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-96198582022-11-01 Federated learning for preserving data privacy in collaborative healthcare research Loftus, Tyler J Ruppert, Matthew M Shickel, Benjamin Ozrazgat-Baslanti, Tezcan Balch, Jeremy A Efron, Philip A Upchurch, Gilbert R Rashidi, Parisa Tignanelli, Christopher Bian, Jiang Bihorac, Azra Digit Health Best Practice Generalizability, external validity, and reproducibility are high priorities for artificial intelligence applications in healthcare. Traditional approaches to addressing these elements involve sharing patient data between institutions or practice settings, which can compromise data privacy (individuals’ right to prevent the sharing and disclosure of information about themselves) and data security (simultaneously preserving confidentiality, accuracy, fidelity, and availability of data). This article describes insights from real-world implementation of federated learning techniques that offer opportunities to maintain both data privacy and availability via collaborative machine learning that shares knowledge, not data. Local models are trained separately on local data. As they train, they send local model updates (e.g. coefficients or gradients) for consolidation into a global model. In some use cases, global models outperform local models on new, previously unseen local datasets, suggesting that collaborative learning from a greater number of examples, including a greater number of rare cases, may improve predictive performance. Even when sharing model updates rather than data, privacy leakage can occur when adversaries perform property or membership inference attacks which can be used to ascertain information about the training set. Emerging techniques mitigate risk from adversarial attacks, allowing investigators to maintain both data privacy and availability in collaborative healthcare research. When data heterogeneity between participating centers is high, personalized algorithms may offer greater generalizability by improving performance on data from centers with proportionately smaller training sample sizes. Properly applied, federated learning has the potential to optimize the reproducibility and performance of collaborative learning while preserving data security and privacy. SAGE Publications 2022-10-27 /pmc/articles/PMC9619858/ /pubmed/36325438 http://dx.doi.org/10.1177/20552076221134455 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Best Practice
Loftus, Tyler J
Ruppert, Matthew M
Shickel, Benjamin
Ozrazgat-Baslanti, Tezcan
Balch, Jeremy A
Efron, Philip A
Upchurch, Gilbert R
Rashidi, Parisa
Tignanelli, Christopher
Bian, Jiang
Bihorac, Azra
Federated learning for preserving data privacy in collaborative healthcare research
title Federated learning for preserving data privacy in collaborative healthcare research
title_full Federated learning for preserving data privacy in collaborative healthcare research
title_fullStr Federated learning for preserving data privacy in collaborative healthcare research
title_full_unstemmed Federated learning for preserving data privacy in collaborative healthcare research
title_short Federated learning for preserving data privacy in collaborative healthcare research
title_sort federated learning for preserving data privacy in collaborative healthcare research
topic Best Practice
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9619858/
https://www.ncbi.nlm.nih.gov/pubmed/36325438
http://dx.doi.org/10.1177/20552076221134455
work_keys_str_mv AT loftustylerj federatedlearningforpreservingdataprivacyincollaborativehealthcareresearch
AT ruppertmatthewm federatedlearningforpreservingdataprivacyincollaborativehealthcareresearch
AT shickelbenjamin federatedlearningforpreservingdataprivacyincollaborativehealthcareresearch
AT ozrazgatbaslantitezcan federatedlearningforpreservingdataprivacyincollaborativehealthcareresearch
AT balchjeremya federatedlearningforpreservingdataprivacyincollaborativehealthcareresearch
AT efronphilipa federatedlearningforpreservingdataprivacyincollaborativehealthcareresearch
AT upchurchgilbertr federatedlearningforpreservingdataprivacyincollaborativehealthcareresearch
AT rashidiparisa federatedlearningforpreservingdataprivacyincollaborativehealthcareresearch
AT tignanellichristopher federatedlearningforpreservingdataprivacyincollaborativehealthcareresearch
AT bianjiang federatedlearningforpreservingdataprivacyincollaborativehealthcareresearch
AT bihoracazra federatedlearningforpreservingdataprivacyincollaborativehealthcareresearch