Cargando…

Cloud-Based Federated Learning Implementation Across Medical Centers

Building well-performing machine learning (ML) models in health care has always been exigent because of the data-sharing concerns, yet ML approaches often require larger training samples than is afforded by one institution. This paper explores several federated learning implementations by applying t...

Descripción completa

Detalles Bibliográficos
Autores principales: Rajendran, Suraj, Obeid, Jihad S., Binol, Hamidullah, D`Agostino, Ralph, Foley, Kristie, Zhang, Wei, Austin, Philip, Brakefield, Joey, Gurcan, Metin N., Topaloglu, Umit
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society of Clinical Oncology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8140794/
https://www.ncbi.nlm.nih.gov/pubmed/33411624
http://dx.doi.org/10.1200/CCI.20.00060
_version_ 1783696246206103552
author Rajendran, Suraj
Obeid, Jihad S.
Binol, Hamidullah
D`Agostino, Ralph
Foley, Kristie
Zhang, Wei
Austin, Philip
Brakefield, Joey
Gurcan, Metin N.
Topaloglu, Umit
author_facet Rajendran, Suraj
Obeid, Jihad S.
Binol, Hamidullah
D`Agostino, Ralph
Foley, Kristie
Zhang, Wei
Austin, Philip
Brakefield, Joey
Gurcan, Metin N.
Topaloglu, Umit
author_sort Rajendran, Suraj
collection PubMed
description Building well-performing machine learning (ML) models in health care has always been exigent because of the data-sharing concerns, yet ML approaches often require larger training samples than is afforded by one institution. This paper explores several federated learning implementations by applying them in both a simulated environment and an actual implementation using electronic health record data from two academic medical centers on a Microsoft Azure Cloud Databricks platform. MATERIALS AND METHODS: Using two separate cloud tenants, ML models were created, trained, and exchanged from one institution to another via a GitHub repository. Federated learning processes were applied to both artificial neural networks (ANNs) and logistic regression (LR) models on the horizontal data sets that are varying in count and availability. Incremental and cyclic federated learning models have been tested in simulation and real environments. RESULTS: The cyclically trained ANN showed a 3% increase in performance, a significant improvement across most attempts (P < .05). Single weight neural network models showed improvement in some cases. However, LR models did not show much improvement after federated learning processes. The specific process that improved the performance differed based on the ML model and how federated learning was implemented. Moreover, we have confirmed that the order of the institutions during the training did influence the overall performance increase. CONCLUSION: Unlike previous studies, our work has shown the implementation and effectiveness of federated learning processes beyond simulation. Additionally, we have identified different federated learning models that have achieved statistically significant performances. More work is needed to achieve effective federated learning processes in biomedicine, while preserving the security and privacy of the data.
format Online
Article
Text
id pubmed-8140794
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society of Clinical Oncology
record_format MEDLINE/PubMed
spelling pubmed-81407942022-01-07 Cloud-Based Federated Learning Implementation Across Medical Centers Rajendran, Suraj Obeid, Jihad S. Binol, Hamidullah D`Agostino, Ralph Foley, Kristie Zhang, Wei Austin, Philip Brakefield, Joey Gurcan, Metin N. Topaloglu, Umit JCO Clin Cancer Inform ORIGINAL REPORTS Building well-performing machine learning (ML) models in health care has always been exigent because of the data-sharing concerns, yet ML approaches often require larger training samples than is afforded by one institution. This paper explores several federated learning implementations by applying them in both a simulated environment and an actual implementation using electronic health record data from two academic medical centers on a Microsoft Azure Cloud Databricks platform. MATERIALS AND METHODS: Using two separate cloud tenants, ML models were created, trained, and exchanged from one institution to another via a GitHub repository. Federated learning processes were applied to both artificial neural networks (ANNs) and logistic regression (LR) models on the horizontal data sets that are varying in count and availability. Incremental and cyclic federated learning models have been tested in simulation and real environments. RESULTS: The cyclically trained ANN showed a 3% increase in performance, a significant improvement across most attempts (P < .05). Single weight neural network models showed improvement in some cases. However, LR models did not show much improvement after federated learning processes. The specific process that improved the performance differed based on the ML model and how federated learning was implemented. Moreover, we have confirmed that the order of the institutions during the training did influence the overall performance increase. CONCLUSION: Unlike previous studies, our work has shown the implementation and effectiveness of federated learning processes beyond simulation. Additionally, we have identified different federated learning models that have achieved statistically significant performances. More work is needed to achieve effective federated learning processes in biomedicine, while preserving the security and privacy of the data. American Society of Clinical Oncology 2021-01-07 /pmc/articles/PMC8140794/ /pubmed/33411624 http://dx.doi.org/10.1200/CCI.20.00060 Text en © 2021 by American Society of Clinical Oncology https://creativecommons.org/licenses/by-nc-nd/4.0/Creative Commons Attribution Non-Commercial No Derivatives 4.0 License: https://creativecommons.org/licenses/by-nc-nd/4.0/
spellingShingle ORIGINAL REPORTS
Rajendran, Suraj
Obeid, Jihad S.
Binol, Hamidullah
D`Agostino, Ralph
Foley, Kristie
Zhang, Wei
Austin, Philip
Brakefield, Joey
Gurcan, Metin N.
Topaloglu, Umit
Cloud-Based Federated Learning Implementation Across Medical Centers
title Cloud-Based Federated Learning Implementation Across Medical Centers
title_full Cloud-Based Federated Learning Implementation Across Medical Centers
title_fullStr Cloud-Based Federated Learning Implementation Across Medical Centers
title_full_unstemmed Cloud-Based Federated Learning Implementation Across Medical Centers
title_short Cloud-Based Federated Learning Implementation Across Medical Centers
title_sort cloud-based federated learning implementation across medical centers
topic ORIGINAL REPORTS
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8140794/
https://www.ncbi.nlm.nih.gov/pubmed/33411624
http://dx.doi.org/10.1200/CCI.20.00060
work_keys_str_mv AT rajendransuraj cloudbasedfederatedlearningimplementationacrossmedicalcenters
AT obeidjihads cloudbasedfederatedlearningimplementationacrossmedicalcenters
AT binolhamidullah cloudbasedfederatedlearningimplementationacrossmedicalcenters
AT dagostinoralph cloudbasedfederatedlearningimplementationacrossmedicalcenters
AT foleykristie cloudbasedfederatedlearningimplementationacrossmedicalcenters
AT zhangwei cloudbasedfederatedlearningimplementationacrossmedicalcenters
AT austinphilip cloudbasedfederatedlearningimplementationacrossmedicalcenters
AT brakefieldjoey cloudbasedfederatedlearningimplementationacrossmedicalcenters
AT gurcanmetinn cloudbasedfederatedlearningimplementationacrossmedicalcenters
AT topalogluumit cloudbasedfederatedlearningimplementationacrossmedicalcenters