Cargando…

An EMD-Based Adaptive Client Selection Algorithm for Federated Learning in Heterogeneous Data Scenarios

Federated learning is a distributed machine learning framework that enables distributed nodes with computation and storage capabilities to train a global model while keeping distributed-stored data locally. This process can promote the efficiency of modeling while preserving data privacy. Therefore,...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Aiguo, Fu, Yang, Sha, Zexin, Lu, Guoming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9218865/
https://www.ncbi.nlm.nih.gov/pubmed/35755701
http://dx.doi.org/10.3389/fpls.2022.908814
_version_ 1784731988641447936
author Chen, Aiguo
Fu, Yang
Sha, Zexin
Lu, Guoming
author_facet Chen, Aiguo
Fu, Yang
Sha, Zexin
Lu, Guoming
author_sort Chen, Aiguo
collection PubMed
description Federated learning is a distributed machine learning framework that enables distributed nodes with computation and storage capabilities to train a global model while keeping distributed-stored data locally. This process can promote the efficiency of modeling while preserving data privacy. Therefore, federated learning can be widely applied in distributed conjoint analysis scenarios, such as smart plant protection systems, in which widely networked IoT devices are used to monitor the critical data of plant production to improve crop production. However, the data collected by different IoT devices can be dependent and identically distributed (non-IID), causing the challenge of statistical heterogeneity. Studies have also shown that statistical heterogeneity can lead to a marked decline in the efficiency of federated learning, making it challenging to apply in practice. To promote the efficiency of federated learning in statistical heterogeneity scenarios, an adaptive client selection algorithm for federated learning in statistical heterogeneous scenarios called ACSFed is proposed in this paper. ACSFed can dynamically calculate the possibility of clients being selected to train the model for each communication round based on their local statistical heterogeneity and previous training performance instead of randomly selected clients, and clients with heavier statistical heterogeneity or bad training performance would be more likely selected to participate in the later training. This client selection strategy can enable the federated model to learn the global statistical knowledge faster and thereby promote the convergence of the federated model. Multiple experiments on public benchmark datasets demonstrate these improvements in the efficiency of the models in heterogeneous settings.
format Online
Article
Text
id pubmed-9218865
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92188652022-06-24 An EMD-Based Adaptive Client Selection Algorithm for Federated Learning in Heterogeneous Data Scenarios Chen, Aiguo Fu, Yang Sha, Zexin Lu, Guoming Front Plant Sci Plant Science Federated learning is a distributed machine learning framework that enables distributed nodes with computation and storage capabilities to train a global model while keeping distributed-stored data locally. This process can promote the efficiency of modeling while preserving data privacy. Therefore, federated learning can be widely applied in distributed conjoint analysis scenarios, such as smart plant protection systems, in which widely networked IoT devices are used to monitor the critical data of plant production to improve crop production. However, the data collected by different IoT devices can be dependent and identically distributed (non-IID), causing the challenge of statistical heterogeneity. Studies have also shown that statistical heterogeneity can lead to a marked decline in the efficiency of federated learning, making it challenging to apply in practice. To promote the efficiency of federated learning in statistical heterogeneity scenarios, an adaptive client selection algorithm for federated learning in statistical heterogeneous scenarios called ACSFed is proposed in this paper. ACSFed can dynamically calculate the possibility of clients being selected to train the model for each communication round based on their local statistical heterogeneity and previous training performance instead of randomly selected clients, and clients with heavier statistical heterogeneity or bad training performance would be more likely selected to participate in the later training. This client selection strategy can enable the federated model to learn the global statistical knowledge faster and thereby promote the convergence of the federated model. Multiple experiments on public benchmark datasets demonstrate these improvements in the efficiency of the models in heterogeneous settings. Frontiers Media S.A. 2022-06-09 /pmc/articles/PMC9218865/ /pubmed/35755701 http://dx.doi.org/10.3389/fpls.2022.908814 Text en Copyright © 2022 Chen, Fu, Sha and Lu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Chen, Aiguo
Fu, Yang
Sha, Zexin
Lu, Guoming
An EMD-Based Adaptive Client Selection Algorithm for Federated Learning in Heterogeneous Data Scenarios
title An EMD-Based Adaptive Client Selection Algorithm for Federated Learning in Heterogeneous Data Scenarios
title_full An EMD-Based Adaptive Client Selection Algorithm for Federated Learning in Heterogeneous Data Scenarios
title_fullStr An EMD-Based Adaptive Client Selection Algorithm for Federated Learning in Heterogeneous Data Scenarios
title_full_unstemmed An EMD-Based Adaptive Client Selection Algorithm for Federated Learning in Heterogeneous Data Scenarios
title_short An EMD-Based Adaptive Client Selection Algorithm for Federated Learning in Heterogeneous Data Scenarios
title_sort emd-based adaptive client selection algorithm for federated learning in heterogeneous data scenarios
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9218865/
https://www.ncbi.nlm.nih.gov/pubmed/35755701
http://dx.doi.org/10.3389/fpls.2022.908814
work_keys_str_mv AT chenaiguo anemdbasedadaptiveclientselectionalgorithmforfederatedlearninginheterogeneousdatascenarios
AT fuyang anemdbasedadaptiveclientselectionalgorithmforfederatedlearninginheterogeneousdatascenarios
AT shazexin anemdbasedadaptiveclientselectionalgorithmforfederatedlearninginheterogeneousdatascenarios
AT luguoming anemdbasedadaptiveclientselectionalgorithmforfederatedlearninginheterogeneousdatascenarios
AT chenaiguo emdbasedadaptiveclientselectionalgorithmforfederatedlearninginheterogeneousdatascenarios
AT fuyang emdbasedadaptiveclientselectionalgorithmforfederatedlearninginheterogeneousdatascenarios
AT shazexin emdbasedadaptiveclientselectionalgorithmforfederatedlearninginheterogeneousdatascenarios
AT luguoming emdbasedadaptiveclientselectionalgorithmforfederatedlearninginheterogeneousdatascenarios