Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jianwei Huang

Adaptive Heterogeneous Client Sampling for Federated Learning over Wireless Networks

Apr 22, 2024

Bing Luo, Wenli Xiao, Shiqiang Wang, Jianwei Huang, Leandros Tassiulas

Figure 1 for Adaptive Heterogeneous Client Sampling for Federated Learning over Wireless Networks

Figure 2 for Adaptive Heterogeneous Client Sampling for Federated Learning over Wireless Networks

Figure 3 for Adaptive Heterogeneous Client Sampling for Federated Learning over Wireless Networks

Figure 4 for Adaptive Heterogeneous Client Sampling for Federated Learning over Wireless Networks

Abstract:Federated learning (FL) algorithms usually sample a fraction of clients in each round (partial participation) when the number of participants is large and the server's communication bandwidth is limited. Recent works on the convergence analysis of FL have focused on unbiased client sampling, e.g., sampling uniformly at random, which suffers from slow wall-clock time for convergence due to high degrees of system heterogeneity and statistical heterogeneity. This paper aims to design an adaptive client sampling algorithm for FL over wireless networks that tackles both system and statistical heterogeneity to minimize the wall-clock convergence time. We obtain a new tractable convergence bound for FL algorithms with arbitrary client sampling probability. Based on the bound, we analytically establish the relationship between the total learning time and sampling probability with an adaptive bandwidth allocation scheme, which results in a non-convex optimization problem. We design an efficient algorithm for learning the unknown parameters in the convergence bound and develop a low-complexity algorithm to approximately solve the non-convex problem. Our solution reveals the impact of system and statistical heterogeneity parameters on the optimal client sampling design. Moreover, our solution shows that as the number of sampled clients increases, the total convergence time first decreases and then increases because a larger sampling number reduces the number of rounds for convergence but results in a longer expected time per-round due to limited wireless bandwidth. Experimental results from both hardware prototype and simulation demonstrate that our proposed sampling scheme significantly reduces the convergence time compared to several baseline sampling schemes.

* Published in IEEE Transactions on Mobile Computing (TMC). arXiv admin note: substantial text overlap with arXiv:2112.11256

Via

Access Paper or Ask Questions

Federated Learning While Providing Model as a Service: Joint Training and Inference Optimization

Dec 21, 2023

Pengchao Han, Shiqiang Wang, Yang Jiao, Jianwei Huang

Abstract:While providing machine learning model as a service to process users' inference requests, online applications can periodically upgrade the model utilizing newly collected data. Federated learning (FL) is beneficial for enabling the training of models across distributed clients while keeping the data locally. However, existing work has overlooked the coexistence of model training and inference under clients' limited resources. This paper focuses on the joint optimization of model training and inference to maximize inference performance at clients. Such an optimization faces several challenges. The first challenge is to characterize the clients' inference performance when clients may partially participate in FL. To resolve this challenge, we introduce a new notion of age of model (AoM) to quantify client-side model freshness, based on which we use FL's global model convergence error as an approximate measure of inference performance. The second challenge is the tight coupling among clients' decisions, including participation probability in FL, model download probability, and service rates. Toward the challenges, we propose an online problem approximation to reduce the problem complexity and optimize the resources to balance the needs of model training and inference. Experimental results demonstrate that the proposed algorithm improves the average inference accuracy by up to 12%.

* Accepted by IEEE International Conference on Computer Communications (INFOCOM) 2024

Via

Access Paper or Ask Questions

Provably Convergent Federated Trilevel Learning

Dec 19, 2023

Yang Jiao, Kai Yang, Tiancheng Wu, Chengtao Jian, Jianwei Huang

Abstract:Trilevel learning, also called trilevel optimization (TLO), has been recognized as a powerful modelling tool for hierarchical decision process and widely applied in many machine learning applications, such as robust neural architecture search, hyperparameter optimization, and domain adaptation. Tackling TLO problems has presented a great challenge due to their nested decision-making structure. In addition, existing works on TLO face the following key challenges: 1) they all focus on the non-distributed setting, which may lead to privacy breach; 2) they do not offer any non-asymptotic convergence analysis which characterizes how fast an algorithm converges. To address the aforementioned challenges, this paper proposes an asynchronous federated trilevel optimization method to solve TLO problems. The proposed method utilizes $\mu$-cuts to construct a hyper-polyhedral approximation for the TLO problem and solve it in an asynchronous manner. We demonstrate that the proposed $\mu$-cuts are applicable to not only convex functions but also a wide range of non-convex functions that meet the $\mu$-weakly convex assumption. Furthermore, we theoretically analyze the non-asymptotic convergence rate for the proposed method by showing its iteration complexity to obtain $\epsilon$-stationary point is upper bounded by $\mathcal{O}(\frac{1}{\epsilon^2})$. Extensive experiments on real-world datasets have been conducted to elucidate the superiority of the proposed method, e.g., it has a faster convergence rate with a maximum acceleration of approximately 80$\%$.

* Accepted at AAAI 2024

Via

Access Paper or Ask Questions

FedAL: Black-Box Federated Knowledge Distillation Enabled by Adversarial Learning

Nov 28, 2023

Pengchao Han, Xingyan Shi, Jianwei Huang

Figure 1 for FedAL: Black-Box Federated Knowledge Distillation Enabled by Adversarial Learning

Figure 2 for FedAL: Black-Box Federated Knowledge Distillation Enabled by Adversarial Learning

Figure 3 for FedAL: Black-Box Federated Knowledge Distillation Enabled by Adversarial Learning

Figure 4 for FedAL: Black-Box Federated Knowledge Distillation Enabled by Adversarial Learning

Abstract:Knowledge distillation (KD) can enable collaborative learning among distributed clients that have different model architectures and do not share their local data and model parameters with others. Each client updates its local model using the average model output/feature of all client models as the target, known as federated KD. However, existing federated KD methods often do not perform well when clients' local models are trained with heterogeneous local datasets. In this paper, we propose Federated knowledge distillation enabled by Adversarial Learning (FedAL) to address the data heterogeneity among clients. First, to alleviate the local model output divergence across clients caused by data heterogeneity, the server acts as a discriminator to guide clients' local model training to achieve consensus model outputs among clients through a min-max game between clients and the discriminator. Moreover, catastrophic forgetting may happen during the clients' local training and global knowledge transfer due to clients' heterogeneous local data. Towards this challenge, we design the less-forgetting regularization for both local training and global knowledge transfer to guarantee clients' ability to transfer/learn knowledge to/from others. Experimental results show that FedAL and its variants achieve higher accuracy than other federated KD baselines.

Via

Access Paper or Ask Questions

Incentive Mechanism Design for Distributed Ensemble Learning

Oct 13, 2023

Chao Huang, Pengchao Han, Jianwei Huang

Figure 1 for Incentive Mechanism Design for Distributed Ensemble Learning

Figure 2 for Incentive Mechanism Design for Distributed Ensemble Learning

Figure 3 for Incentive Mechanism Design for Distributed Ensemble Learning

Figure 4 for Incentive Mechanism Design for Distributed Ensemble Learning

Abstract:Distributed ensemble learning (DEL) involves training multiple models at distributed learners, and then combining their predictions to improve performance. Existing related studies focus on DEL algorithm design and optimization but ignore the important issue of incentives, without which self-interested learners may be unwilling to participate in DEL. We aim to fill this gap by presenting a first study on the incentive mechanism design for DEL. Our proposed mechanism specifies both the amount of training data and reward for learners with heterogeneous computation and communication costs. One design challenge is to have an accurate understanding regarding how learners' diversity (in terms of training data) affects the ensemble accuracy. To this end, we decompose the ensemble accuracy into a diversity-precision tradeoff to guide the mechanism design. Another challenge is that the mechanism design involves solving a mixed-integer program with a large search space. To this end, we propose an alternating algorithm that iteratively updates each learner's training data size and reward. We prove that under mild conditions, the algorithm converges. Numerical results using MNIST dataset show an interesting result: our proposed mechanism may prefer a lower level of learner diversity to achieve a higher ensemble accuracy.

* Accepted to IEEE GLOBECOM 2023

Via

Access Paper or Ask Questions

Incentive Mechanism Design for Unbiased Federated Learning with Randomized Client Participation

Apr 17, 2023

Bing Luo, Yutong Feng, Shiqiang Wang, Jianwei Huang, Leandros Tassiulas

Figure 1 for Incentive Mechanism Design for Unbiased Federated Learning with Randomized Client Participation

Figure 2 for Incentive Mechanism Design for Unbiased Federated Learning with Randomized Client Participation

Figure 3 for Incentive Mechanism Design for Unbiased Federated Learning with Randomized Client Participation

Figure 4 for Incentive Mechanism Design for Unbiased Federated Learning with Randomized Client Participation

Abstract:Incentive mechanism is crucial for federated learning (FL) when rational clients do not have the same interests in the global model as the server. However, due to system heterogeneity and limited budget, it is generally impractical for the server to incentivize all clients to participate in all training rounds (known as full participation). The existing FL incentive mechanisms are typically designed by stimulating a fixed subset of clients based on their data quantity or system resources. Hence, FL is performed only using this subset of clients throughout the entire training process, leading to a biased model because of data heterogeneity. This paper proposes a game theoretic incentive mechanism for FL with randomized client participation, where the server adopts a customized pricing strategy that motivates different clients to join with different participation levels (probabilities) for obtaining an unbiased and high performance model. Each client responds to the server's monetary incentive by choosing its best participation level, to maximize its profit based on not only the incurred local cost but also its intrinsic value for the global model. To effectively evaluate clients' contribution to the model performance, we derive a new convergence bound which analytically predicts how clients' arbitrary participation levels and their heterogeneous data affect the model performance. By solving a non-convex optimization problem, our analysis reveals that the intrinsic value leads to the interesting possibility of bidirectional payment between the server and clients. Experimental results using real datasets on a hardware prototype demonstrate the superiority of our mechanism in achieving higher model performance for the server as well as higher profits for the clients.

* Accepted in ICDCS 2023

Via

Access Paper or Ask Questions

Optimization Design for Federated Learning in Heterogeneous 6G Networks

Mar 15, 2023

Bing Luo, Xiaomin Ouyang, Peng Sun, Pengchao Han, Ningning Ding, Jianwei Huang

Figure 1 for Optimization Design for Federated Learning in Heterogeneous 6G Networks

Figure 2 for Optimization Design for Federated Learning in Heterogeneous 6G Networks

Figure 3 for Optimization Design for Federated Learning in Heterogeneous 6G Networks

Figure 4 for Optimization Design for Federated Learning in Heterogeneous 6G Networks

Abstract:With the rapid advancement of 5G networks, billions of smart Internet of Things (IoT) devices along with an enormous amount of data are generated at the network edge. While still at an early age, it is expected that the evolving 6G network will adopt advanced artificial intelligence (AI) technologies to collect, transmit, and learn this valuable data for innovative applications and intelligent services. However, traditional machine learning (ML) approaches require centralizing the training data in the data center or cloud, raising serious user-privacy concerns. Federated learning, as an emerging distributed AI paradigm with privacy-preserving nature, is anticipated to be a key enabler for achieving ubiquitous AI in 6G networks. However, there are several system and statistical heterogeneity challenges for effective and efficient FL implementation in 6G networks. In this article, we investigate the optimization approaches that can effectively address the challenging heterogeneity issues from three aspects: incentive mechanism design, network resource management, and personalized model optimization. We also present some open problems and promising directions for future research.

* Accepted in IEEE Nework

Via

Access Paper or Ask Questions

Cross-Silo Federated Learning: Challenges and Opportunities

Jun 26, 2022

Chao Huang, Jianwei Huang, Xin Liu

Figure 1 for Cross-Silo Federated Learning: Challenges and Opportunities

Figure 2 for Cross-Silo Federated Learning: Challenges and Opportunities

Abstract:Federated learning (FL) is an emerging technology that enables the training of machine learning models from multiple clients while keeping the data distributed and private. Based on the participating clients and the model training scale, federated learning can be classified into two types: cross-device FL where clients are typically mobile devices and the client number can reach up to a scale of millions; cross-silo FL where clients are organizations or companies and the client number is usually small (e.g., within a hundred). While existing studies mainly focus on cross-device FL, this paper aims to provide an overview of the cross-silo FL. More specifically, we first discuss applications of cross-silo FL and outline its major challenges. We then provide a systematic overview of the existing approaches to the challenges in cross-silo FL by focusing on their connections and differences to cross-device FL. Finally, we discuss future directions and open issues that merit research efforts from the community.

Via

Access Paper or Ask Questions

Socially-Optimal Mechanism Design for Incentivized Online Learning

Dec 29, 2021

Zhiyuan Wang, Lin Gao, Jianwei Huang

Figure 1 for Socially-Optimal Mechanism Design for Incentivized Online Learning

Figure 2 for Socially-Optimal Mechanism Design for Incentivized Online Learning

Figure 3 for Socially-Optimal Mechanism Design for Incentivized Online Learning

Figure 4 for Socially-Optimal Mechanism Design for Incentivized Online Learning

Abstract:Multi-arm bandit (MAB) is a classic online learning framework that studies the sequential decision-making in an uncertain environment. The MAB framework, however, overlooks the scenario where the decision-maker cannot take actions (e.g., pulling arms) directly. It is a practically important scenario in many applications such as spectrum sharing, crowdsensing, and edge computing. In these applications, the decision-maker would incentivize other selfish agents to carry out desired actions (i.e., pulling arms on the decision-maker's behalf). This paper establishes the incentivized online learning (IOL) framework for this scenario. The key challenge to design the IOL framework lies in the tight coupling of the unknown environment learning and asymmetric information revelation. To address this, we construct a special Lagrangian function based on which we propose a socially-optimal mechanism for the IOL framework. Our mechanism satisfies various desirable properties such as agent fairness, incentive compatibility, and voluntary participation. It achieves the same asymptotic performance as the state-of-art benchmark that requires extra information. Our analysis also unveils the power of crowd in the IOL framework: a larger agent crowd enables our mechanism to approach more closely the theoretical upper bound of social performance. Numerical results demonstrate the advantages of our mechanism in large-scale edge computing.

* IEEE INFOCOM 2022

Via

Access Paper or Ask Questions

Tackling System and Statistical Heterogeneity for Federated Learning with Adaptive Client Sampling

Dec 21, 2021

Bing Luo, Wenli Xiao, Shiqiang Wang, Jianwei Huang, Leandros Tassiulas

Figure 1 for Tackling System and Statistical Heterogeneity for Federated Learning with Adaptive Client Sampling

Figure 2 for Tackling System and Statistical Heterogeneity for Federated Learning with Adaptive Client Sampling

Figure 3 for Tackling System and Statistical Heterogeneity for Federated Learning with Adaptive Client Sampling

Figure 4 for Tackling System and Statistical Heterogeneity for Federated Learning with Adaptive Client Sampling

Abstract:Federated learning (FL) algorithms usually sample a fraction of clients in each round (partial participation) when the number of participants is large and the server's communication bandwidth is limited. Recent works on the convergence analysis of FL have focused on unbiased client sampling, e.g., sampling uniformly at random, which suffers from slow wall-clock time for convergence due to high degrees of system heterogeneity and statistical heterogeneity. This paper aims to design an adaptive client sampling algorithm that tackles both system and statistical heterogeneity to minimize the wall-clock convergence time. We obtain a new tractable convergence bound for FL algorithms with arbitrary client sampling probabilities. Based on the bound, we analytically establish the relationship between the total learning time and sampling probabilities, which results in a non-convex optimization problem for training time minimization. We design an efficient algorithm for learning the unknown parameters in the convergence bound and develop a low-complexity algorithm to approximately solve the non-convex problem. Experimental results from both hardware prototype and simulation demonstrate that our proposed sampling scheme significantly reduces the convergence time compared to several baseline sampling schemes. Notably, our scheme in hardware prototype spends 73% less time than the uniform sampling baseline for reaching the same target loss.

* Accepted in IEEE INFOCOM 2022

Via

Access Paper or Ask Questions