Cargando…

Network Coding Approaches for Distributed Computation over Lossy Wireless Networks

In wireless distributed computing systems, worker nodes connect to a master node wirelessly and perform large-scale computational tasks that are parallelized across them. However, the common phenomenon of straggling (i.e., worker nodes often experience unpredictable slowdown during computation and c...

Descripción completa

Detalles Bibliográficos
Autores principales: Fan, Bin, Tang, Bin, Qu, Zhihao, Ye, Baoliu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10047255/
https://www.ncbi.nlm.nih.gov/pubmed/36981318
http://dx.doi.org/10.3390/e25030428
_version_ 1785013875067846656
author Fan, Bin
Tang, Bin
Qu, Zhihao
Ye, Baoliu
author_facet Fan, Bin
Tang, Bin
Qu, Zhihao
Ye, Baoliu
author_sort Fan, Bin
collection PubMed
description In wireless distributed computing systems, worker nodes connect to a master node wirelessly and perform large-scale computational tasks that are parallelized across them. However, the common phenomenon of straggling (i.e., worker nodes often experience unpredictable slowdown during computation and communication) and packet losses due to severe channel fading can significantly increase the latency of computational tasks. In this paper, we consider a heterogeneous, wireless, distributed computing system performing large-scale matrix multiplications which form the core of many machine learning applications. To address the aforementioned challenges, we first propose a random linear network coding (RLNC) approach that leverages the linearity of matrix multiplication, which has many salient properties, including ratelessness, maximum straggler tolerance and near-ideal load balancing. We then theoretically demonstrate that its latency converges to the optimum in probability when the matrix size grows to infinity. To combat the high encoding and decoding overheads of the RLNC approach, we further propose a practical variation based on batched sparse (BATS) code. The effectiveness of our proposed approaches is demonstrated by numerical simulations.
format Online
Article
Text
id pubmed-10047255
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100472552023-03-29 Network Coding Approaches for Distributed Computation over Lossy Wireless Networks Fan, Bin Tang, Bin Qu, Zhihao Ye, Baoliu Entropy (Basel) Article In wireless distributed computing systems, worker nodes connect to a master node wirelessly and perform large-scale computational tasks that are parallelized across them. However, the common phenomenon of straggling (i.e., worker nodes often experience unpredictable slowdown during computation and communication) and packet losses due to severe channel fading can significantly increase the latency of computational tasks. In this paper, we consider a heterogeneous, wireless, distributed computing system performing large-scale matrix multiplications which form the core of many machine learning applications. To address the aforementioned challenges, we first propose a random linear network coding (RLNC) approach that leverages the linearity of matrix multiplication, which has many salient properties, including ratelessness, maximum straggler tolerance and near-ideal load balancing. We then theoretically demonstrate that its latency converges to the optimum in probability when the matrix size grows to infinity. To combat the high encoding and decoding overheads of the RLNC approach, we further propose a practical variation based on batched sparse (BATS) code. The effectiveness of our proposed approaches is demonstrated by numerical simulations. MDPI 2023-02-27 /pmc/articles/PMC10047255/ /pubmed/36981318 http://dx.doi.org/10.3390/e25030428 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Fan, Bin
Tang, Bin
Qu, Zhihao
Ye, Baoliu
Network Coding Approaches for Distributed Computation over Lossy Wireless Networks
title Network Coding Approaches for Distributed Computation over Lossy Wireless Networks
title_full Network Coding Approaches for Distributed Computation over Lossy Wireless Networks
title_fullStr Network Coding Approaches for Distributed Computation over Lossy Wireless Networks
title_full_unstemmed Network Coding Approaches for Distributed Computation over Lossy Wireless Networks
title_short Network Coding Approaches for Distributed Computation over Lossy Wireless Networks
title_sort network coding approaches for distributed computation over lossy wireless networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10047255/
https://www.ncbi.nlm.nih.gov/pubmed/36981318
http://dx.doi.org/10.3390/e25030428
work_keys_str_mv AT fanbin networkcodingapproachesfordistributedcomputationoverlossywirelessnetworks
AT tangbin networkcodingapproachesfordistributedcomputationoverlossywirelessnetworks
AT quzhihao networkcodingapproachesfordistributedcomputationoverlossywirelessnetworks
AT yebaoliu networkcodingapproachesfordistributedcomputationoverlossywirelessnetworks