Cargando…

Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals

OBJECTIVE: Federated learning (FL) allows multiple distributed data holders to collaboratively learn a shared model without data sharing. However, individual health system data are heterogeneous. “Personalized” FL variations have been developed to counter data heterogeneity, but few have been evalua...

Descripción completa

Detalles Bibliográficos
Autores principales: Peng, Le, Luo, Gaoxiang, Walker, Andrew, Zaiman, Zachary, Jones, Emma K, Gupta, Hemant, Kersten, Kristopher, Burns, John L, Harle, Christopher A, Magoc, Tanja, Shickel, Benjamin, Steenburg, Scott D, Loftus, Tyler, Melton, Genevieve B, Gichoya, Judy Wawira, Sun, Ju, Tignanelli, Christopher J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9619688/
https://www.ncbi.nlm.nih.gov/pubmed/36214629
http://dx.doi.org/10.1093/jamia/ocac188
_version_ 1784821278208688128
author Peng, Le
Luo, Gaoxiang
Walker, Andrew
Zaiman, Zachary
Jones, Emma K
Gupta, Hemant
Kersten, Kristopher
Burns, John L
Harle, Christopher A
Magoc, Tanja
Shickel, Benjamin
Steenburg, Scott D
Loftus, Tyler
Melton, Genevieve B
Gichoya, Judy Wawira
Sun, Ju
Tignanelli, Christopher J
author_facet Peng, Le
Luo, Gaoxiang
Walker, Andrew
Zaiman, Zachary
Jones, Emma K
Gupta, Hemant
Kersten, Kristopher
Burns, John L
Harle, Christopher A
Magoc, Tanja
Shickel, Benjamin
Steenburg, Scott D
Loftus, Tyler
Melton, Genevieve B
Gichoya, Judy Wawira
Sun, Ju
Tignanelli, Christopher J
author_sort Peng, Le
collection PubMed
description OBJECTIVE: Federated learning (FL) allows multiple distributed data holders to collaboratively learn a shared model without data sharing. However, individual health system data are heterogeneous. “Personalized” FL variations have been developed to counter data heterogeneity, but few have been evaluated using real-world healthcare data. The purpose of this study is to investigate the performance of a single-site versus a 3-client federated model using a previously described Coronavirus Disease 19 (COVID-19) diagnostic model. Additionally, to investigate the effect of system heterogeneity, we evaluate the performance of 4 FL variations. MATERIALS AND METHODS: We leverage a FL healthcare collaborative including data from 5 international healthcare systems (US and Europe) encompassing 42 hospitals. We implemented a COVID-19 computer vision diagnosis system using the Federated Averaging (FedAvg) algorithm implemented on Clara Train SDK 4.0. To study the effect of data heterogeneity, training data was pooled from 3 systems locally and federation was simulated. We compared a centralized/pooled model, versus FedAvg, and 3 personalized FL variations (FedProx, FedBN, and FedAMP). RESULTS: We observed comparable model performance with respect to internal validation (local model: AUROC 0.94 vs FedAvg: 0.95, P = .5) and improved model generalizability with the FedAvg model (P < .05). When investigating the effects of model heterogeneity, we observed poor performance with FedAvg on internal validation as compared to personalized FL algorithms. FedAvg did have improved generalizability compared to personalized FL algorithms. On average, FedBN had the best rank performance on internal and external validation. CONCLUSION: FedAvg can significantly improve the generalization of the model compared to other personalization FL algorithms; however, at the cost of poor internal validity. Personalized FL may offer an opportunity to develop both internal and externally validated algorithms.
format Online
Article
Text
id pubmed-9619688
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-96196882022-11-04 Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals Peng, Le Luo, Gaoxiang Walker, Andrew Zaiman, Zachary Jones, Emma K Gupta, Hemant Kersten, Kristopher Burns, John L Harle, Christopher A Magoc, Tanja Shickel, Benjamin Steenburg, Scott D Loftus, Tyler Melton, Genevieve B Gichoya, Judy Wawira Sun, Ju Tignanelli, Christopher J J Am Med Inform Assoc Research and Applications OBJECTIVE: Federated learning (FL) allows multiple distributed data holders to collaboratively learn a shared model without data sharing. However, individual health system data are heterogeneous. “Personalized” FL variations have been developed to counter data heterogeneity, but few have been evaluated using real-world healthcare data. The purpose of this study is to investigate the performance of a single-site versus a 3-client federated model using a previously described Coronavirus Disease 19 (COVID-19) diagnostic model. Additionally, to investigate the effect of system heterogeneity, we evaluate the performance of 4 FL variations. MATERIALS AND METHODS: We leverage a FL healthcare collaborative including data from 5 international healthcare systems (US and Europe) encompassing 42 hospitals. We implemented a COVID-19 computer vision diagnosis system using the Federated Averaging (FedAvg) algorithm implemented on Clara Train SDK 4.0. To study the effect of data heterogeneity, training data was pooled from 3 systems locally and federation was simulated. We compared a centralized/pooled model, versus FedAvg, and 3 personalized FL variations (FedProx, FedBN, and FedAMP). RESULTS: We observed comparable model performance with respect to internal validation (local model: AUROC 0.94 vs FedAvg: 0.95, P = .5) and improved model generalizability with the FedAvg model (P < .05). When investigating the effects of model heterogeneity, we observed poor performance with FedAvg on internal validation as compared to personalized FL algorithms. FedAvg did have improved generalizability compared to personalized FL algorithms. On average, FedBN had the best rank performance on internal and external validation. CONCLUSION: FedAvg can significantly improve the generalization of the model compared to other personalization FL algorithms; however, at the cost of poor internal validity. Personalized FL may offer an opportunity to develop both internal and externally validated algorithms. Oxford University Press 2022-10-10 /pmc/articles/PMC9619688/ /pubmed/36214629 http://dx.doi.org/10.1093/jamia/ocac188 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com https://academic.oup.com/pages/standard-publication-reuse-rightsThis article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/pages/standard-publication-reuse-rights)
spellingShingle Research and Applications
Peng, Le
Luo, Gaoxiang
Walker, Andrew
Zaiman, Zachary
Jones, Emma K
Gupta, Hemant
Kersten, Kristopher
Burns, John L
Harle, Christopher A
Magoc, Tanja
Shickel, Benjamin
Steenburg, Scott D
Loftus, Tyler
Melton, Genevieve B
Gichoya, Judy Wawira
Sun, Ju
Tignanelli, Christopher J
Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals
title Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals
title_full Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals
title_fullStr Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals
title_full_unstemmed Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals
title_short Evaluation of federated learning variations for COVID-19 diagnosis using chest radiographs from 42 US and European hospitals
title_sort evaluation of federated learning variations for covid-19 diagnosis using chest radiographs from 42 us and european hospitals
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9619688/
https://www.ncbi.nlm.nih.gov/pubmed/36214629
http://dx.doi.org/10.1093/jamia/ocac188
work_keys_str_mv AT pengle evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT luogaoxiang evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT walkerandrew evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT zaimanzachary evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT jonesemmak evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT guptahemant evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT kerstenkristopher evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT burnsjohnl evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT harlechristophera evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT magoctanja evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT shickelbenjamin evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT steenburgscottd evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT loftustyler evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT meltongenevieveb evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT gichoyajudywawira evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT sunju evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals
AT tignanellichristopherj evaluationoffederatedlearningvariationsforcovid19diagnosisusingchestradiographsfrom42usandeuropeanhospitals