Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Timothy Castiglia

PBM-VFL: Vertical Federated Learning with Feature and Sample Privacy

Jan 27, 2025

Linh Tran, Timothy Castiglia, Stacy Patterson, Ana Milanova

Abstract:We present Poisson Binomial Mechanism Vertical Federated Learning (PBM-VFL), a communication-efficient Vertical Federated Learning algorithm with Differential Privacy guarantees. PBM-VFL combines Secure Multi-Party Computation with the recently introduced Poisson Binomial Mechanism to protect parties' private datasets during model training. We define the novel concept of feature privacy and analyze end-to-end feature and sample privacy of our algorithm. We compare sample privacy loss in VFL with privacy loss in HFL. We also provide the first theoretical characterization of the relationship between privacy budget, convergence error, and communication cost in differentially-private VFL. Finally, we empirically show that our model performs well with high levels of privacy.

Via

Access Paper or Ask Questions

LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning

May 03, 2023

Timothy Castiglia, Yi Zhou, Shiqiang Wang, Swanand Kadhe, Nathalie Baracaldo, Stacy Patterson

Figure 1 for LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning

Figure 2 for LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning

Figure 3 for LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning

Figure 4 for LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning

Abstract:We propose LESS-VFL, a communication-efficient feature selection method for distributed systems with vertically partitioned data. We consider a system of a server and several parties with local datasets that share a sample ID space but have different feature sets. The parties wish to collaboratively train a model for a prediction task. As part of the training, the parties wish to remove unimportant features in the system to improve generalization, efficiency, and explainability. In LESS-VFL, after a short pre-training period, the server optimizes its part of the global model to determine the relevant outputs from party models. This information is shared with the parties to then allow local feature selection without communication. We analytically prove that LESS-VFL removes spurious features from model training. We provide extensive empirical evidence that LESS-VFL can achieve high accuracy and remove spurious features at a fraction of the communication cost of other feature selection approaches.

* Published in ICML 2023

Via

Access Paper or Ask Questions

Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data

Jun 16, 2022

Timothy Castiglia, Anirban Das, Shiqiang Wang, Stacy Patterson

Figure 1 for Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data

Figure 2 for Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data

Figure 3 for Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data

Figure 4 for Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data

Abstract:We propose Compressed Vertical Federated Learning (C-VFL) for communication-efficient training on vertically partitioned data. In C-VFL, a server and multiple parties collaboratively train a model on their respective features utilizing several local iterations and sharing compressed intermediate results periodically. Our work provides the first theoretical analysis of the effect message compression has on distributed training over vertically partitioned data. We prove convergence of non-convex objectives at a rate of $O(\frac{1}{\sqrt{T}})$ when the compression error is bounded over the course of training. We provide specific requirements for convergence with common compression techniques, such as quantization and top-$k$ sparsification. Finally, we experimentally show compression can reduce communication by over $90\%$ without a significant decrease in accuracy over VFL without compression.

Via

Access Paper or Ask Questions

Multi-Level Local SGD for Heterogeneous Hierarchical Networks

Jul 27, 2020

Timothy Castiglia, Anirban Das, Stacy Patterson

Figure 1 for Multi-Level Local SGD for Heterogeneous Hierarchical Networks

Figure 2 for Multi-Level Local SGD for Heterogeneous Hierarchical Networks

Figure 3 for Multi-Level Local SGD for Heterogeneous Hierarchical Networks

Figure 4 for Multi-Level Local SGD for Heterogeneous Hierarchical Networks

Abstract:We propose Multi-Level Local SGD, a distributed gradient method for learning a smooth, non-convex objective in a heterogeneous multi-level network. Our network model consists of a set of disjoint sub-networks, with a single hub and multiple worker nodes; further, worker nodes may have different operating rates. The hubs exchange information with one another via a connected, but not necessarily complete communication network. In our algorithm, sub-networks execute a distributed SGD algorithm, using a hub-and-spoke paradigm, and the hubs periodically average their models with neighboring hubs. We first provide a unified mathematical framework that describes the Multi-Level Local SGD algorithm. We then present a theoretical analysis of the algorithm; our analysis shows the dependence of the convergence error on the worker node heterogeneity, hub network topology, and the number of local, sub-network, and global iterations. We back up our theoretical results via simulation-based experiments using both convex and non-convex objectives.

* 34 pages, 5 figures

Via

Access Paper or Ask Questions