Cargando…

Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation

BACKGROUND: Artificial neural networks have achieved unprecedented success in the medical domain. This success depends on the availability of massive and representative datasets. However, data collection is often prevented by privacy concerns, and people want to take control over their sensitive inf...

Descripción completa

Detalles Bibliográficos
Autores principales: Shao, Rulin, He, Hongyu, Chen, Ziwei, Liu, Hui, Liu, Dianbo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7909896/
https://www.ncbi.nlm.nih.gov/pubmed/33350391
http://dx.doi.org/10.2196/17265
_version_ 1783656020999929856
author Shao, Rulin
He, Hongyu
Chen, Ziwei
Liu, Hui
Liu, Dianbo
author_facet Shao, Rulin
He, Hongyu
Chen, Ziwei
Liu, Hui
Liu, Dianbo
author_sort Shao, Rulin
collection PubMed
description BACKGROUND: Artificial neural networks have achieved unprecedented success in the medical domain. This success depends on the availability of massive and representative datasets. However, data collection is often prevented by privacy concerns, and people want to take control over their sensitive information during both the training and using processes. OBJECTIVE: To address security and privacy issues, we propose a privacy-preserving method for the analysis of distributed medical data. The proposed method, termed stochastic channel-based federated learning (SCBFL), enables participants to train a high-performance model cooperatively and in a distributed manner without sharing their inputs. METHODS: We designed, implemented, and evaluated a channel-based update algorithm for a central server in a distributed system. The update algorithm will select the channels with regard to the most active features in a training loop, and then upload them as learned information from local datasets. A pruning process, which serves as a model accelerator, was further applied to the algorithm based on the validation set. RESULTS: We constructed a distributed system consisting of 5 clients and 1 server. Our trials showed that the SCBFL method can achieve an area under the receiver operating characteristic curve (AUC-ROC) of 0.9776 and an area under the precision-recall curve (AUC-PR) of 0.9695 with only 10% of channels shared with the server. Compared with the federated averaging algorithm, the proposed SCBFL method achieved a 0.05388 higher AUC-ROC and 0.09695 higher AUC-PR. In addition, our experiment showed that 57% of the time is saved by the pruning process with only a reduction of 0.0047 in AUC-ROC performance and a reduction of 0.0068 in AUC-PR performance. CONCLUSIONS: In this experiment, our model demonstrated better performance and a higher saturating speed than the federated averaging method, which reveals all of the parameters of local models to the server. The saturation rate of performance could be promoted by introducing a pruning process and further improvement could be achieved by tuning the pruning rate.
format Online
Article
Text
id pubmed-7909896
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-79098962021-03-04 Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation Shao, Rulin He, Hongyu Chen, Ziwei Liu, Hui Liu, Dianbo JMIR Form Res Original Paper BACKGROUND: Artificial neural networks have achieved unprecedented success in the medical domain. This success depends on the availability of massive and representative datasets. However, data collection is often prevented by privacy concerns, and people want to take control over their sensitive information during both the training and using processes. OBJECTIVE: To address security and privacy issues, we propose a privacy-preserving method for the analysis of distributed medical data. The proposed method, termed stochastic channel-based federated learning (SCBFL), enables participants to train a high-performance model cooperatively and in a distributed manner without sharing their inputs. METHODS: We designed, implemented, and evaluated a channel-based update algorithm for a central server in a distributed system. The update algorithm will select the channels with regard to the most active features in a training loop, and then upload them as learned information from local datasets. A pruning process, which serves as a model accelerator, was further applied to the algorithm based on the validation set. RESULTS: We constructed a distributed system consisting of 5 clients and 1 server. Our trials showed that the SCBFL method can achieve an area under the receiver operating characteristic curve (AUC-ROC) of 0.9776 and an area under the precision-recall curve (AUC-PR) of 0.9695 with only 10% of channels shared with the server. Compared with the federated averaging algorithm, the proposed SCBFL method achieved a 0.05388 higher AUC-ROC and 0.09695 higher AUC-PR. In addition, our experiment showed that 57% of the time is saved by the pruning process with only a reduction of 0.0047 in AUC-ROC performance and a reduction of 0.0068 in AUC-PR performance. CONCLUSIONS: In this experiment, our model demonstrated better performance and a higher saturating speed than the federated averaging method, which reveals all of the parameters of local models to the server. The saturation rate of performance could be promoted by introducing a pruning process and further improvement could be achieved by tuning the pruning rate. JMIR Publications 2020-12-22 /pmc/articles/PMC7909896/ /pubmed/33350391 http://dx.doi.org/10.2196/17265 Text en ©Rulin Shao, Hongyu He, Ziwei Chen, Hui Liu, Dianbo Liu. Originally published in JMIR Formative Research (http://formative.jmir.org), 22.12.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on http://formative.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Shao, Rulin
He, Hongyu
Chen, Ziwei
Liu, Hui
Liu, Dianbo
Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation
title Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation
title_full Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation
title_fullStr Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation
title_full_unstemmed Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation
title_short Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation
title_sort stochastic channel-based federated learning with neural network pruning for medical data privacy preservation: model development and experimental validation
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7909896/
https://www.ncbi.nlm.nih.gov/pubmed/33350391
http://dx.doi.org/10.2196/17265
work_keys_str_mv AT shaorulin stochasticchannelbasedfederatedlearningwithneuralnetworkpruningformedicaldataprivacypreservationmodeldevelopmentandexperimentalvalidation
AT hehongyu stochasticchannelbasedfederatedlearningwithneuralnetworkpruningformedicaldataprivacypreservationmodeldevelopmentandexperimentalvalidation
AT chenziwei stochasticchannelbasedfederatedlearningwithneuralnetworkpruningformedicaldataprivacypreservationmodeldevelopmentandexperimentalvalidation
AT liuhui stochasticchannelbasedfederatedlearningwithneuralnetworkpruningformedicaldataprivacypreservationmodeldevelopmentandexperimentalvalidation
AT liudianbo stochasticchannelbasedfederatedlearningwithneuralnetworkpruningformedicaldataprivacypreservationmodeldevelopmentandexperimentalvalidation