Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jiajun Luo

A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning

Jul 06, 2024

Jiajun Song, Jiajun Luo, Rongwei Lu, Shuzhao Xie, Bin Chen, Zhi Wang

Figure 1 for A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning

Figure 2 for A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning

Figure 3 for A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning

Figure 4 for A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning

Abstract:Asynchronous Federated Learning (AFL) confronts inherent challenges arising from the heterogeneity of devices (e.g., their computation capacities) and low-bandwidth environments, both potentially causing stale model updates (e.g., local gradients) for global aggregation. Traditional approaches mitigating the staleness of updates typically focus on either adjusting the local updating or gradient compression, but not both. Recognizing this gap, we introduce a novel approach that synergizes local updating with gradient compression. Our research begins by examining the interplay between local updating frequency and gradient compression rate, and their collective impact on convergence speed. The theoretical upper bound shows that the local updating frequency and gradient compression rate of each device are jointly determined by its computing power, communication capabilities and other factors. Building on this foundation, we propose an AFL framework called FedLuck that adaptively optimizes both local update frequency and gradient compression rates. Experiments on image classification and speech recognization show that FedLuck reduces communication consumption by 56% and training time by 55% on average, achieving competitive performance in heterogeneous and low-bandwidth scenarios compared to the baselines.

Via

Access Paper or Ask Questions

Empirical Bayes Estimation with Side Information: A Nonparametric Integrative Tweedie Approach

Aug 11, 2023

Jiajun Luo, Trambak Banerjee, Gourab Mukherjee, Wenguang Sun

Abstract:We investigate the problem of compound estimation of normal means while accounting for the presence of side information. Leveraging the empirical Bayes framework, we develop a nonparametric integrative Tweedie (NIT) approach that incorporates structural knowledge encoded in multivariate auxiliary data to enhance the precision of compound estimation. Our approach employs convex optimization tools to estimate the gradient of the log-density directly, enabling the incorporation of structural constraints. We conduct theoretical analyses of the asymptotic risk of NIT and establish the rate at which NIT converges to the oracle estimator. As the dimension of the auxiliary data increases, we accurately quantify the improvements in estimation risk and the associated deterioration in convergence rate. The numerical performance of NIT is illustrated through the analysis of both simulated and real data, demonstrating its superiority over existing methods.

* the paper is based on chapter 2 of Jiajun Luo's PhD thesis which is archived at https://digitallibrary.usc.edu/asset-management/2A3BF1OVJ47TH

Via

Access Paper or Ask Questions

FedLP: Layer-wise Pruning Mechanism for Communication-Computation Efficient Federated Learning

Mar 11, 2023

Zheqi Zhu, Yuchen Shi, Jiajun Luo, Fei Wang, Chenghui Peng, Pingyi Fan, Khaled B. Letaief

Figure 1 for FedLP: Layer-wise Pruning Mechanism for Communication-Computation Efficient Federated Learning

Figure 2 for FedLP: Layer-wise Pruning Mechanism for Communication-Computation Efficient Federated Learning

Figure 3 for FedLP: Layer-wise Pruning Mechanism for Communication-Computation Efficient Federated Learning

Figure 4 for FedLP: Layer-wise Pruning Mechanism for Communication-Computation Efficient Federated Learning

Abstract:Federated learning (FL) has prevailed as an efficient and privacy-preserved scheme for distributed learning. In this work, we mainly focus on the optimization of computation and communication in FL from a view of pruning. By adopting layer-wise pruning in local training and federated updating, we formulate an explicit FL pruning framework, FedLP (Federated Layer-wise Pruning), which is model-agnostic and universal for different types of deep learning models. Two specific schemes of FedLP are designed for scenarios with homogeneous local models and heterogeneous ones. Both theoretical and experimental evaluations are developed to verify that FedLP relieves the system bottlenecks of communication and computation with marginal performance decay. To the best of our knowledge, FedLP is the first framework that formally introduces the layer-wise pruning into FL. Within the scope of federated learning, more variants and combinations can be further designed based on FedLP.

Via

Access Paper or Ask Questions

Learning-based Autonomous Channel Access in the Presence of Hidden Terminals

Jul 07, 2022

Yulin Shao, Yucheng Cai, Taotao Wang, Ziyang Guo, Peng Liu, Jiajun Luo, Deniz Gunduz

Figure 1 for Learning-based Autonomous Channel Access in the Presence of Hidden Terminals

Figure 2 for Learning-based Autonomous Channel Access in the Presence of Hidden Terminals

Figure 3 for Learning-based Autonomous Channel Access in the Presence of Hidden Terminals

Figure 4 for Learning-based Autonomous Channel Access in the Presence of Hidden Terminals

Abstract:We consider the problem of autonomous channel access (AutoCA), where a group of terminals tries to discover a communication strategy with an access point (AP) via a common wireless channel in a distributed fashion. Due to the irregular topology and the limited communication range of terminals, a practical challenge for AutoCA is the hidden terminal problem, which is notorious in wireless networks for deteriorating the throughput and delay performances. To meet the challenge, this paper presents a new multi-agent deep reinforcement learning paradigm, dubbed MADRL-HT, tailored for AutoCA in the presence of hidden terminals. MADRL-HT exploits topological insights and transforms the observation space of each terminal into a scalable form independent of the number of terminals. To compensate for the partial observability, we put forth a look-back mechanism such that the terminals can infer behaviors of their hidden terminals from the carrier sensed channel states as well as feedback from the AP. A window-based global reward function is proposed, whereby the terminals are instructed to maximize the system throughput while balancing the terminals' transmission opportunities over the course of learning. Extensive numerical experiments verified the superior performance of our solution benchmarked against the legacy carrier-sense multiple access with collision avoidance (CSMA/CA) protocol.

* Keywords: multiple channel access, hidden terminal, multi-agent deep reinforcement learning, Wi-Fi, proximal policy optimization

Via

Access Paper or Ask Questions