Cargando…

Performance Enhancement in Federated Learning by Reducing Class Imbalance of Non-IID Data

Due to the distributed data collection and learning in federated learnings, many clients conduct local training with non-independent and identically distributed (non-IID) datasets. Accordingly, the training from these datasets results in severe performance degradation. We propose an efficient algori...

Descripción completa

Detalles Bibliográficos
Autores principales:	Seol, Mihye, Kim, Taejoon
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9919903/ https://www.ncbi.nlm.nih.gov/pubmed/36772192 http://dx.doi.org/10.3390/s23031152

_version_	1784886938548830208
author	Seol, Mihye Kim, Taejoon
author_facet	Seol, Mihye Kim, Taejoon
author_sort	Seol, Mihye
collection	PubMed
description	Due to the distributed data collection and learning in federated learnings, many clients conduct local training with non-independent and identically distributed (non-IID) datasets. Accordingly, the training from these datasets results in severe performance degradation. We propose an efficient algorithm for enhancing the performance of federated learning by overcoming the negative effects of non-IID datasets. First, the intra-client class imbalance is reduced by rendering the class distribution of clients close to Uniform distribution. Second, the clients to participate in federated learning are selected to make their integrated class distribution close to Uniform distribution for the purpose of mitigating the inter-client class imbalance, which represents the class distribution difference among clients. In addition, the amount of local training data for the selected clients is finely adjusted. Finally, in order to increase the efficiency of federated learning, the batch size and the learning rate of local training for the selected clients are dynamically controlled reflecting the effective size of the local dataset for each client. In the performance evaluation on CIFAR-10 and MNIST datasets, the proposed algorithm achieves 20% higher accuracy than existing federated learning algorithms. Moreover, in achieving this huge accuracy improvement, the proposed algorithm uses less computation and communication resources compared to existing algorithms in terms of the amount of data used and the number of clients joined in the training.
format	Online Article Text
id	pubmed-9919903
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-99199032023-02-12 Performance Enhancement in Federated Learning by Reducing Class Imbalance of Non-IID Data Seol, Mihye Kim, Taejoon Sensors (Basel) Article Due to the distributed data collection and learning in federated learnings, many clients conduct local training with non-independent and identically distributed (non-IID) datasets. Accordingly, the training from these datasets results in severe performance degradation. We propose an efficient algorithm for enhancing the performance of federated learning by overcoming the negative effects of non-IID datasets. First, the intra-client class imbalance is reduced by rendering the class distribution of clients close to Uniform distribution. Second, the clients to participate in federated learning are selected to make their integrated class distribution close to Uniform distribution for the purpose of mitigating the inter-client class imbalance, which represents the class distribution difference among clients. In addition, the amount of local training data for the selected clients is finely adjusted. Finally, in order to increase the efficiency of federated learning, the batch size and the learning rate of local training for the selected clients are dynamically controlled reflecting the effective size of the local dataset for each client. In the performance evaluation on CIFAR-10 and MNIST datasets, the proposed algorithm achieves 20% higher accuracy than existing federated learning algorithms. Moreover, in achieving this huge accuracy improvement, the proposed algorithm uses less computation and communication resources compared to existing algorithms in terms of the amount of data used and the number of clients joined in the training. MDPI 2023-01-19 /pmc/articles/PMC9919903/ /pubmed/36772192 http://dx.doi.org/10.3390/s23031152 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Seol, Mihye Kim, Taejoon Performance Enhancement in Federated Learning by Reducing Class Imbalance of Non-IID Data
title	Performance Enhancement in Federated Learning by Reducing Class Imbalance of Non-IID Data
title_full	Performance Enhancement in Federated Learning by Reducing Class Imbalance of Non-IID Data
title_fullStr	Performance Enhancement in Federated Learning by Reducing Class Imbalance of Non-IID Data
title_full_unstemmed	Performance Enhancement in Federated Learning by Reducing Class Imbalance of Non-IID Data
title_short	Performance Enhancement in Federated Learning by Reducing Class Imbalance of Non-IID Data
title_sort	performance enhancement in federated learning by reducing class imbalance of non-iid data
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9919903/ https://www.ncbi.nlm.nih.gov/pubmed/36772192 http://dx.doi.org/10.3390/s23031152
work_keys_str_mv	AT seolmihye performanceenhancementinfederatedlearningbyreducingclassimbalanceofnoniiddata AT kimtaejoon performanceenhancementinfederatedlearningbyreducingclassimbalanceofnoniiddata

Performance Enhancement in Federated Learning by Reducing Class Imbalance of Non-IID Data

Ejemplares similares