Cargando…

Federated learning and differential privacy for medical image analysis

The artificial intelligence revolution has been spurred forward by the availability of large-scale datasets. In contrast, the paucity of large-scale medical datasets hinders the application of machine learning in healthcare. The lack of publicly available multi-centric and diverse datasets mainly st...

Descripción completa

Detalles Bibliográficos
Autores principales: Adnan, Mohammed, Kalra, Shivam, Cresswell, Jesse C., Taylor, Graham W., Tizhoosh, Hamid R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8816913/
https://www.ncbi.nlm.nih.gov/pubmed/35121774
http://dx.doi.org/10.1038/s41598-022-05539-7
_version_ 1784645526932684800
author Adnan, Mohammed
Kalra, Shivam
Cresswell, Jesse C.
Taylor, Graham W.
Tizhoosh, Hamid R.
author_facet Adnan, Mohammed
Kalra, Shivam
Cresswell, Jesse C.
Taylor, Graham W.
Tizhoosh, Hamid R.
author_sort Adnan, Mohammed
collection PubMed
description The artificial intelligence revolution has been spurred forward by the availability of large-scale datasets. In contrast, the paucity of large-scale medical datasets hinders the application of machine learning in healthcare. The lack of publicly available multi-centric and diverse datasets mainly stems from confidentiality and privacy concerns around sharing medical data. To demonstrate a feasible path forward in medical image imaging, we conduct a case study of applying a differentially private federated learning framework for analysis of histopathology images, the largest and perhaps most complex medical images. We study the effects of IID and non-IID distributions along with the number of healthcare providers, i.e., hospitals and clinics, and the individual dataset sizes, using The Cancer Genome Atlas (TCGA) dataset, a public repository, to simulate a distributed environment. We empirically compare the performance of private, distributed training to conventional training and demonstrate that distributed training can achieve similar performance with strong privacy guarantees. We also study the effect of different source domains for histopathology images by evaluating the performance using external validation. Our work indicates that differentially private federated learning is a viable and reliable framework for the collaborative development of machine learning models in medical image analysis.
format Online
Article
Text
id pubmed-8816913
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-88169132022-02-07 Federated learning and differential privacy for medical image analysis Adnan, Mohammed Kalra, Shivam Cresswell, Jesse C. Taylor, Graham W. Tizhoosh, Hamid R. Sci Rep Article The artificial intelligence revolution has been spurred forward by the availability of large-scale datasets. In contrast, the paucity of large-scale medical datasets hinders the application of machine learning in healthcare. The lack of publicly available multi-centric and diverse datasets mainly stems from confidentiality and privacy concerns around sharing medical data. To demonstrate a feasible path forward in medical image imaging, we conduct a case study of applying a differentially private federated learning framework for analysis of histopathology images, the largest and perhaps most complex medical images. We study the effects of IID and non-IID distributions along with the number of healthcare providers, i.e., hospitals and clinics, and the individual dataset sizes, using The Cancer Genome Atlas (TCGA) dataset, a public repository, to simulate a distributed environment. We empirically compare the performance of private, distributed training to conventional training and demonstrate that distributed training can achieve similar performance with strong privacy guarantees. We also study the effect of different source domains for histopathology images by evaluating the performance using external validation. Our work indicates that differentially private federated learning is a viable and reliable framework for the collaborative development of machine learning models in medical image analysis. Nature Publishing Group UK 2022-02-04 /pmc/articles/PMC8816913/ /pubmed/35121774 http://dx.doi.org/10.1038/s41598-022-05539-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Adnan, Mohammed
Kalra, Shivam
Cresswell, Jesse C.
Taylor, Graham W.
Tizhoosh, Hamid R.
Federated learning and differential privacy for medical image analysis
title Federated learning and differential privacy for medical image analysis
title_full Federated learning and differential privacy for medical image analysis
title_fullStr Federated learning and differential privacy for medical image analysis
title_full_unstemmed Federated learning and differential privacy for medical image analysis
title_short Federated learning and differential privacy for medical image analysis
title_sort federated learning and differential privacy for medical image analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8816913/
https://www.ncbi.nlm.nih.gov/pubmed/35121774
http://dx.doi.org/10.1038/s41598-022-05539-7
work_keys_str_mv AT adnanmohammed federatedlearninganddifferentialprivacyformedicalimageanalysis
AT kalrashivam federatedlearninganddifferentialprivacyformedicalimageanalysis
AT cresswelljessec federatedlearninganddifferentialprivacyformedicalimageanalysis
AT taylorgrahamw federatedlearninganddifferentialprivacyformedicalimageanalysis
AT tizhooshhamidr federatedlearninganddifferentialprivacyformedicalimageanalysis