Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nicolò Michelusi

Biased Federated Learning under Wireless Heterogeneity

Mar 08, 2025

Muhammad Faraz Ul Abrar, Nicolò Michelusi

Abstract:Federated learning (FL) has emerged as a promising framework for distributed learning, enabling collaborative model training without sharing private data. Existing wireless FL works primarily adopt two communication strategies: (1) over-the-air (OTA) computation, which exploits wireless signal superposition for simultaneous gradient aggregation, and (2) digital communication, which allocates orthogonal resources for gradient uploads. Prior works on both schemes typically assume \emph{homogeneous} wireless conditions (equal path loss across devices) to enforce zero-bias updates or permit uncontrolled bias, resulting in suboptimal performance and high-variance model updates in \emph{heterogeneous} environments, where devices with poor channel conditions slow down convergence. This paper addresses FL over heterogeneous wireless networks by proposing novel OTA and digital FL updates that allow a structured, time-invariant model bias, thereby reducing variance in FL updates. We analyze their convergence under a unified framework and derive an upper bound on the model ``optimality error", which explicitly quantifies the effect of bias and variance in terms of design parameters. Next, to optimize this trade-off, we study a non-convex optimization problem and develop a successive convex approximation (SCA)-based framework to jointly optimize the design parameters. We perform extensive numerical evaluations with several related design variants and state-of-the-art OTA and digital FL schemes. Our results confirm that minimizing the bias-variance trade-off while allowing a structured bias provides better FL convergence performance than existing schemes.

* Submitted to IEEE Transactions on Wireless Communications

Via

Access Paper or Ask Questions

NCAirFL: CSI-Free Over-the-Air Federated Learning Based on Non-Coherent Detection

Nov 20, 2024

Haifeng Wen, Nicolò Michelusi, Osvaldo Simeone, Hong Xing

Figure 1 for NCAirFL: CSI-Free Over-the-Air Federated Learning Based on Non-Coherent Detection

Figure 2 for NCAirFL: CSI-Free Over-the-Air Federated Learning Based on Non-Coherent Detection

Figure 3 for NCAirFL: CSI-Free Over-the-Air Federated Learning Based on Non-Coherent Detection

Abstract:Over-the-air federated learning (FL), i.e., AirFL, leverages computing primitively over multiple access channels. A long-standing challenge in AirFL is to achieve coherent signal alignment without relying on expensive channel estimation and feedback. This paper proposes NCAirFL, a CSI-free AirFL scheme based on unbiased non-coherent detection at the edge server. By exploiting binary dithering and a long-term memory based error-compensation mechanism, NCAirFL achieves a convergence rate of order $\mathcal{O}(1/\sqrt{T})$ in terms of the average square norm of the gradient for general non-convex and smooth objectives, where $T$ is the number of communication rounds. Experiments demonstrate the competitive performance of NCAirFL compared to vanilla FL with ideal communications and to coherent transmission-based benchmarks.

* 6 pages, 2 figures, submitted for possible publication

Via

Access Paper or Ask Questions

Biased Over-the-Air Federated Learning under Wireless Heterogeneity

Mar 28, 2024

Muhammad Faraz Ul Abrar, Nicolò Michelusi

Abstract:Recently, Over-the-Air (OTA) computation has emerged as a promising federated learning (FL) paradigm that leverages the waveform superposition properties of the wireless channel to realize fast model updates. Prior work focused on the OTA device ``pre-scaler" design under \emph{homogeneous} wireless conditions, in which devices experience the same average path loss, resulting in zero-bias solutions. Yet, zero-bias designs are limited by the device with the worst average path loss and hence may perform poorly in \emph{heterogeneous} wireless settings. In this scenario, there may be a benefit in designing \emph{biased} solutions, in exchange for a lower variance in the model updates. To optimize this trade-off, we study the design of OTA device pre-scalers by focusing on the OTA-FL convergence. We derive an upper bound on the model ``optimality error", which explicitly captures the effect of bias and variance in terms of the choice of the pre-scalers. Based on this bound, we identify two solutions of interest: minimum noise variance, and minimum noise variance zero-bias solutions. Numerical evaluations show that using OTA device pre-scalers that minimize the variance of FL updates, while allowing a small bias, can provide high gains over existing schemes.

* Accepted at IEEE International Conference on Communications (ICC), 2024

Via

Access Paper or Ask Questions

Analog-digital Scheduling for Federated Learning: A Communication-Efficient Approach

Feb 02, 2024

Muhammad Faraz Ul Abrar, Nicolò Michelusi

Abstract:Over-the-air (OTA) computation has recently emerged as a communication-efficient Federated Learning (FL) paradigm to train machine learning models over wireless networks. However, its performance is limited by the device with the worst SNR, resulting in fast yet noisy updates. On the other hand, allocating orthogonal resource blocks (RB) to individual devices via digital channels mitigates the noise problem, at the cost of increased communication latency. In this paper, we address this discrepancy and present ADFL, a novel Analog-Digital FL scheme: in each round, the parameter server (PS) schedules each device to either upload its gradient via the analog OTA scheme or transmit its quantized gradient over an orthogonal RB using the ``digital" scheme. Focusing on a single FL round, we cast the optimal scheduling problem as the minimization of the mean squared error (MSE) on the estimated global gradient at the PS, subject to a delay constraint, yielding the optimal device scheduling configuration and quantization bits for the digital devices. Our simulation results show that ADFL, by scheduling most of the devices in the OTA scheme while also occasionally employing the digital scheme for a few devices, consistently outperforms OTA-only and digital-only schemes, in both i.i.d. and non-i.i.d. settings.

* Appeared at the 2023 Asilomar Conference on Signals, Systems, and Computers

Via

Access Paper or Ask Questions

Delay-Aware Hierarchical Federated Learning

Mar 23, 2023

Frank Po-Chen Lin, Seyyedali Hosseinalipour, Christopher Brinton, Nicolò Michelusi

Figure 1 for Delay-Aware Hierarchical Federated Learning

Figure 2 for Delay-Aware Hierarchical Federated Learning

Figure 3 for Delay-Aware Hierarchical Federated Learning

Figure 4 for Delay-Aware Hierarchical Federated Learning

Abstract:Federated learning has gained popularity as a means of training models distributed across the wireless edge. The paper introduces delay-aware federated learning (DFL) to improve the efficiency of distributed machine learning (ML) model training by addressing communication delays between edge and cloud. DFL employs multiple stochastic gradient descent iterations on device datasets during each global aggregation interval and intermittently aggregates model parameters through edge servers in local subnetworks. The cloud server synchronizes the local models with the global deployed model computed via a local-global combiner at global synchronization. The convergence behavior of DFL is theoretically investigated under a generalized data heterogeneity metric. A set of conditions is obtained to achieve the sub-linear convergence rate of O(1/k). Based on these findings, an adaptive control algorithm is developed for DFL, implementing policies to mitigate energy consumption and edge-to-cloud communication latency while aiming for a sublinear convergence rate. Numerical evaluations show DFL's superior performance in terms of faster global model convergence, reduced resource consumption, and robustness against communication delays compared to existing FL algorithms. In summary, this proposed method offers improved efficiency and satisfactory results when dealing with both convex and non-convex loss functions.

* A condensed version of this paper was presented at IEEE Globecom 2020

Via

Access Paper or Ask Questions

Decentralized Federated Learning via Non-Coherent Over-the-Air Consensus

Oct 27, 2022

Nicolò Michelusi

Abstract:This paper presents NCOTA-DGD, a Decentralized Gradient Descent (DGD) algorithm that combines local gradient descent with Non-Coherent Over-The-Air (NCOTA) consensus at the receivers to solve distributed machine-learning problems over wirelessly-connected systems. NCOTA-DGD leverages the waveform superposition properties of the wireless channels: it enables simultaneous transmissions under half-duplex constraints, by mapping local signals to a mixture of preamble sequences, and consensus via non-coherent combining at the receivers. NCOTA-DGD operates without channel state information and leverages the average channel pathloss to mix signals, without explicit knowledge of the mixing weights (typically known in consensus-based optimization algorithms). It is shown both theoretically and numerically that, for smooth and strongly-convex problems with fixed consensus and learning stepsizes, the updates of NCOTA-DGD converge (in Euclidean distance) to the global optimum with rate $\mathcal O(K^{-1/4})$ for a target number of iterations $K$. NCOTA-DGD is evaluated numerically over a logistic regression problem, showing faster convergence vis-\`a-vis running time than implementations of the classical DGD algorithm over digital and analog orthogonal channels.

* Submitted to IEEE ICC 2023

Via

Access Paper or Ask Questions

Federated Learning Beyond the Star: Local D2D Model Consensus with Global Cluster Sampling

Sep 12, 2021

Frank Po-Chen Lin, Seyyedali Hosseinalipour, Sheikh Shams Azam, Christopher G. Brinton, Nicolò Michelusi

Figure 1 for Federated Learning Beyond the Star: Local D2D Model Consensus with Global Cluster Sampling

Figure 2 for Federated Learning Beyond the Star: Local D2D Model Consensus with Global Cluster Sampling

Figure 3 for Federated Learning Beyond the Star: Local D2D Model Consensus with Global Cluster Sampling

Figure 4 for Federated Learning Beyond the Star: Local D2D Model Consensus with Global Cluster Sampling

Abstract:Federated learning has emerged as a popular technique for distributing model training across the network edge. Its learning architecture is conventionally a star topology between the devices and a central server. In this paper, we propose two timescale hybrid federated learning (TT-HF), which migrates to a more distributed topology via device-to-device (D2D) communications. In TT-HF, local model training occurs at devices via successive gradient iterations, and the synchronization process occurs at two timescales: (i) macro-scale, where global aggregations are carried out via device-server interactions, and (ii) micro-scale, where local aggregations are carried out via D2D cooperative consensus formation in different device clusters. Our theoretical analysis reveals how device, cluster, and network-level parameters affect the convergence of TT-HF, and leads to a set of conditions under which a convergence rate of O(1/t) is guaranteed. Experimental results demonstrate the improvements in convergence and utilization that can be obtained by TT-HF over state-of-the-art federated learning baselines.

* This paper has been published in IEEE Global Communications Conference 2021 (Globecom). arXiv admin note: substantial text overlap with arXiv:2103.10481

Via

Access Paper or Ask Questions

Finite-Bit Quantization For Distributed Algorithms With Linear Convergence

Jul 23, 2021

Chang-Shen Lee, Nicolò Michelusi, Gesualdo Scutari

Figure 1 for Finite-Bit Quantization For Distributed Algorithms With Linear Convergence

Figure 2 for Finite-Bit Quantization For Distributed Algorithms With Linear Convergence

Figure 3 for Finite-Bit Quantization For Distributed Algorithms With Linear Convergence

Figure 4 for Finite-Bit Quantization For Distributed Algorithms With Linear Convergence

Abstract:This paper studies distributed algorithms for (strongly convex) composite optimization problems over mesh networks, subject to quantized communications. Instead of focusing on a specific algorithmic design, we propose a black-box model casting distributed algorithms in the form of fixed-point iterates, converging at linear rate. The algorithmic model is coupled with a novel (random) Biased Compression (BC-)rule on the quantizer design, which preserves linear convergence. A new quantizer coupled with a communication-efficient encoding scheme is also proposed, which efficiently implements the BC-rule using a finite number of bits. This contrasts with most of existing quantization rules, whose implementation calls for an infinite number of bits. A unified communication complexity analysis is developed for the black-box model, determining the average number of bit required to reach a solution of the optimization problem within the required accuracy. Numerical results validate our theoretical findings and show that distributed algorithms equipped with the proposed quantizer have more favorable communication complexity than algorithms using existing quantization rules.

* Submitted to the IEEE Transactions on Information Theory

Via

Access Paper or Ask Questions

Federated Learning with Communication Delay in Edge Networks

Aug 21, 2020

Frank Po-Chen Lin, Christopher G. Brinton, Nicolò Michelusi

Figure 1 for Federated Learning with Communication Delay in Edge Networks

Figure 2 for Federated Learning with Communication Delay in Edge Networks

Figure 3 for Federated Learning with Communication Delay in Edge Networks

Abstract:Federated learning has received significant attention as a potential solution for distributing machine learning (ML) model training through edge networks. This work addresses an important consideration of federated learning at the network edge: communication delays between the edge nodes and the aggregator. A technique called FedDelAvg (federated delayed averaging) is developed, which generalizes the standard federated averaging algorithm to incorporate a weighting between the current local model and the delayed global model received at each device during the synchronization step. Through theoretical analysis, an upper bound is derived on the global model loss achieved by FedDelAvg, which reveals a strong dependency of learning performance on the values of the weighting and learning rate. Experimental results on a popular ML task indicate significant improvements in terms of convergence speed when optimizing the weighting scheme to account for delays.

* Accepted for publication at IEEE Global Communications Conference (Globecom 2020)

Via

Access Paper or Ask Questions