Cargando…

Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data

Several studies underscore the potential of deep learning in identifying complex patterns, leading to diagnostic and prognostic biomarkers. Identifying sufficiently large and diverse datasets, required for training, is a significant challenge in medicine and can rarely be found in individual institu...

Descripción completa

Detalles Bibliográficos
Autores principales: Sheller, Micah J., Edwards, Brandon, Reina, G. Anthony, Martin, Jason, Pati, Sarthak, Kotrotsou, Aikaterini, Milchenko, Mikhail, Xu, Weilin, Marcus, Daniel, Colen, Rivka R., Bakas, Spyridon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7387485/
https://www.ncbi.nlm.nih.gov/pubmed/32724046
http://dx.doi.org/10.1038/s41598-020-69250-1
_version_ 1783564130358132736
author Sheller, Micah J.
Edwards, Brandon
Reina, G. Anthony
Martin, Jason
Pati, Sarthak
Kotrotsou, Aikaterini
Milchenko, Mikhail
Xu, Weilin
Marcus, Daniel
Colen, Rivka R.
Bakas, Spyridon
author_facet Sheller, Micah J.
Edwards, Brandon
Reina, G. Anthony
Martin, Jason
Pati, Sarthak
Kotrotsou, Aikaterini
Milchenko, Mikhail
Xu, Weilin
Marcus, Daniel
Colen, Rivka R.
Bakas, Spyridon
author_sort Sheller, Micah J.
collection PubMed
description Several studies underscore the potential of deep learning in identifying complex patterns, leading to diagnostic and prognostic biomarkers. Identifying sufficiently large and diverse datasets, required for training, is a significant challenge in medicine and can rarely be found in individual institutions. Multi-institutional collaborations based on centrally-shared patient data face privacy and ownership challenges. Federated learning is a novel paradigm for data-private multi-institutional collaborations, where model-learning leverages all available data without sharing data between institutions, by distributing the model-training to the data-owners and aggregating their results. We show that federated learning among 10 institutions results in models reaching 99% of the model quality achieved with centralized data, and evaluate generalizability on data from institutions outside the federation. We further investigate the effects of data distribution across collaborating institutions on model quality and learning patterns, indicating that increased access to data through data private multi-institutional collaborations can benefit model quality more than the errors introduced by the collaborative method. Finally, we compare with other collaborative-learning approaches demonstrating the superiority of federated learning, and discuss practical implementation considerations. Clinical adoption of federated learning is expected to lead to models trained on datasets of unprecedented size, hence have a catalytic impact towards precision/personalized medicine.
format Online
Article
Text
id pubmed-7387485
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-73874852020-07-29 Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data Sheller, Micah J. Edwards, Brandon Reina, G. Anthony Martin, Jason Pati, Sarthak Kotrotsou, Aikaterini Milchenko, Mikhail Xu, Weilin Marcus, Daniel Colen, Rivka R. Bakas, Spyridon Sci Rep Article Several studies underscore the potential of deep learning in identifying complex patterns, leading to diagnostic and prognostic biomarkers. Identifying sufficiently large and diverse datasets, required for training, is a significant challenge in medicine and can rarely be found in individual institutions. Multi-institutional collaborations based on centrally-shared patient data face privacy and ownership challenges. Federated learning is a novel paradigm for data-private multi-institutional collaborations, where model-learning leverages all available data without sharing data between institutions, by distributing the model-training to the data-owners and aggregating their results. We show that federated learning among 10 institutions results in models reaching 99% of the model quality achieved with centralized data, and evaluate generalizability on data from institutions outside the federation. We further investigate the effects of data distribution across collaborating institutions on model quality and learning patterns, indicating that increased access to data through data private multi-institutional collaborations can benefit model quality more than the errors introduced by the collaborative method. Finally, we compare with other collaborative-learning approaches demonstrating the superiority of federated learning, and discuss practical implementation considerations. Clinical adoption of federated learning is expected to lead to models trained on datasets of unprecedented size, hence have a catalytic impact towards precision/personalized medicine. Nature Publishing Group UK 2020-07-28 /pmc/articles/PMC7387485/ /pubmed/32724046 http://dx.doi.org/10.1038/s41598-020-69250-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Sheller, Micah J.
Edwards, Brandon
Reina, G. Anthony
Martin, Jason
Pati, Sarthak
Kotrotsou, Aikaterini
Milchenko, Mikhail
Xu, Weilin
Marcus, Daniel
Colen, Rivka R.
Bakas, Spyridon
Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data
title Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data
title_full Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data
title_fullStr Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data
title_full_unstemmed Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data
title_short Federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data
title_sort federated learning in medicine: facilitating multi-institutional collaborations without sharing patient data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7387485/
https://www.ncbi.nlm.nih.gov/pubmed/32724046
http://dx.doi.org/10.1038/s41598-020-69250-1
work_keys_str_mv AT shellermicahj federatedlearninginmedicinefacilitatingmultiinstitutionalcollaborationswithoutsharingpatientdata
AT edwardsbrandon federatedlearninginmedicinefacilitatingmultiinstitutionalcollaborationswithoutsharingpatientdata
AT reinaganthony federatedlearninginmedicinefacilitatingmultiinstitutionalcollaborationswithoutsharingpatientdata
AT martinjason federatedlearninginmedicinefacilitatingmultiinstitutionalcollaborationswithoutsharingpatientdata
AT patisarthak federatedlearninginmedicinefacilitatingmultiinstitutionalcollaborationswithoutsharingpatientdata
AT kotrotsouaikaterini federatedlearninginmedicinefacilitatingmultiinstitutionalcollaborationswithoutsharingpatientdata
AT milchenkomikhail federatedlearninginmedicinefacilitatingmultiinstitutionalcollaborationswithoutsharingpatientdata
AT xuweilin federatedlearninginmedicinefacilitatingmultiinstitutionalcollaborationswithoutsharingpatientdata
AT marcusdaniel federatedlearninginmedicinefacilitatingmultiinstitutionalcollaborationswithoutsharingpatientdata
AT colenrivkar federatedlearninginmedicinefacilitatingmultiinstitutionalcollaborationswithoutsharingpatientdata
AT bakasspyridon federatedlearninginmedicinefacilitatingmultiinstitutionalcollaborationswithoutsharingpatientdata