Cargando…

Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation

BACKGROUND: Artificial neural networks have achieved unprecedented success in the medical domain. This success depends on the availability of massive and representative datasets. However, data collection is often prevented by privacy concerns, and people want to take control over their sensitive inf...

Descripción completa

Detalles Bibliográficos
Autores principales:	Shao, Rulin, He, Hongyu, Chen, Ziwei, Liu, Hui, Liu, Dianbo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2020
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7909896/ https://www.ncbi.nlm.nih.gov/pubmed/33350391 http://dx.doi.org/10.2196/17265

_version_	1783656020999929856
author	Shao, Rulin He, Hongyu Chen, Ziwei Liu, Hui Liu, Dianbo
author_facet	Shao, Rulin He, Hongyu Chen, Ziwei Liu, Hui Liu, Dianbo
author_sort	Shao, Rulin
collection	PubMed
description	BACKGROUND: Artificial neural networks have achieved unprecedented success in the medical domain. This success depends on the availability of massive and representative datasets. However, data collection is often prevented by privacy concerns, and people want to take control over their sensitive information during both the training and using processes. OBJECTIVE: To address security and privacy issues, we propose a privacy-preserving method for the analysis of distributed medical data. The proposed method, termed stochastic channel-based federated learning (SCBFL), enables participants to train a high-performance model cooperatively and in a distributed manner without sharing their inputs. METHODS: We designed, implemented, and evaluated a channel-based update algorithm for a central server in a distributed system. The update algorithm will select the channels with regard to the most active features in a training loop, and then upload them as learned information from local datasets. A pruning process, which serves as a model accelerator, was further applied to the algorithm based on the validation set. RESULTS: We constructed a distributed system consisting of 5 clients and 1 server. Our trials showed that the SCBFL method can achieve an area under the receiver operating characteristic curve (AUC-ROC) of 0.9776 and an area under the precision-recall curve (AUC-PR) of 0.9695 with only 10% of channels shared with the server. Compared with the federated averaging algorithm, the proposed SCBFL method achieved a 0.05388 higher AUC-ROC and 0.09695 higher AUC-PR. In addition, our experiment showed that 57% of the time is saved by the pruning process with only a reduction of 0.0047 in AUC-ROC performance and a reduction of 0.0068 in AUC-PR performance. CONCLUSIONS: In this experiment, our model demonstrated better performance and a higher saturating speed than the federated averaging method, which reveals all of the parameters of local models to the server. The saturation rate of performance could be promoted by introducing a pruning process and further improvement could be achieved by tuning the pruning rate.
format	Online Article Text
id	pubmed-7909896
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-79098962021-03-04 Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation Shao, Rulin He, Hongyu Chen, Ziwei Liu, Hui Liu, Dianbo JMIR Form Res Original Paper BACKGROUND: Artificial neural networks have achieved unprecedented success in the medical domain. This success depends on the availability of massive and representative datasets. However, data collection is often prevented by privacy concerns, and people want to take control over their sensitive information during both the training and using processes. OBJECTIVE: To address security and privacy issues, we propose a privacy-preserving method for the analysis of distributed medical data. The proposed method, termed stochastic channel-based federated learning (SCBFL), enables participants to train a high-performance model cooperatively and in a distributed manner without sharing their inputs. METHODS: We designed, implemented, and evaluated a channel-based update algorithm for a central server in a distributed system. The update algorithm will select the channels with regard to the most active features in a training loop, and then upload them as learned information from local datasets. A pruning process, which serves as a model accelerator, was further applied to the algorithm based on the validation set. RESULTS: We constructed a distributed system consisting of 5 clients and 1 server. Our trials showed that the SCBFL method can achieve an area under the receiver operating characteristic curve (AUC-ROC) of 0.9776 and an area under the precision-recall curve (AUC-PR) of 0.9695 with only 10% of channels shared with the server. Compared with the federated averaging algorithm, the proposed SCBFL method achieved a 0.05388 higher AUC-ROC and 0.09695 higher AUC-PR. In addition, our experiment showed that 57% of the time is saved by the pruning process with only a reduction of 0.0047 in AUC-ROC performance and a reduction of 0.0068 in AUC-PR performance. CONCLUSIONS: In this experiment, our model demonstrated better performance and a higher saturating speed than the federated averaging method, which reveals all of the parameters of local models to the server. The saturation rate of performance could be promoted by introducing a pruning process and further improvement could be achieved by tuning the pruning rate. JMIR Publications 2020-12-22 /pmc/articles/PMC7909896/ /pubmed/33350391 http://dx.doi.org/10.2196/17265 Text en ©Rulin Shao, Hongyu He, Ziwei Chen, Hui Liu, Dianbo Liu. Originally published in JMIR Formative Research (http://formative.jmir.org), 22.12.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on http://formative.jmir.org, as well as this copyright and license information must be included.
spellingShingle	Original Paper Shao, Rulin He, Hongyu Chen, Ziwei Liu, Hui Liu, Dianbo Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation
title	Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation
title_full	Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation
title_fullStr	Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation
title_full_unstemmed	Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation
title_short	Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation
title_sort	stochastic channel-based federated learning with neural network pruning for medical data privacy preservation: model development and experimental validation
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7909896/ https://www.ncbi.nlm.nih.gov/pubmed/33350391 http://dx.doi.org/10.2196/17265
work_keys_str_mv	AT shaorulin stochasticchannelbasedfederatedlearningwithneuralnetworkpruningformedicaldataprivacypreservationmodeldevelopmentandexperimentalvalidation AT hehongyu stochasticchannelbasedfederatedlearningwithneuralnetworkpruningformedicaldataprivacypreservationmodeldevelopmentandexperimentalvalidation AT chenziwei stochasticchannelbasedfederatedlearningwithneuralnetworkpruningformedicaldataprivacypreservationmodeldevelopmentandexperimentalvalidation AT liuhui stochasticchannelbasedfederatedlearningwithneuralnetworkpruningformedicaldataprivacypreservationmodeldevelopmentandexperimentalvalidation AT liudianbo stochasticchannelbasedfederatedlearningwithneuralnetworkpruningformedicaldataprivacypreservationmodeldevelopmentandexperimentalvalidation

Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation

Ejemplares similares