Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stacy Patterson

PBM-VFL: Vertical Federated Learning with Feature and Sample Privacy

Jan 27, 2025

Linh Tran, Timothy Castiglia, Stacy Patterson, Ana Milanova

Abstract:We present Poisson Binomial Mechanism Vertical Federated Learning (PBM-VFL), a communication-efficient Vertical Federated Learning algorithm with Differential Privacy guarantees. PBM-VFL combines Secure Multi-Party Computation with the recently introduced Poisson Binomial Mechanism to protect parties' private datasets during model training. We define the novel concept of feature privacy and analyze end-to-end feature and sample privacy of our algorithm. We compare sample privacy loss in VFL with privacy loss in HFL. We also provide the first theoretical characterization of the relationship between privacy budget, convergence error, and communication cost in differentially-private VFL. Finally, we empirically show that our model performs well with high levels of privacy.

Via

Access Paper or Ask Questions

Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models

Jan 23, 2025

Linh Tran, Wei Sun, Stacy Patterson, Ana Milanova

Abstract:Multimodal Large Language Models (LLMs) are pivotal in revolutionizing customer support and operations by integrating multiple modalities such as text, images, and audio. Federated Prompt Learning (FPL) is a recently proposed approach that combines pre-trained multimodal LLMs such as vision-language models with federated learning to create personalized, privacy-preserving AI systems. However, balancing the competing goals of personalization, generalization, and privacy remains a significant challenge. Over-personalization can lead to overfitting, reducing generalizability, while stringent privacy measures, such as differential privacy, can hinder both personalization and generalization. In this paper, we propose a Differentially Private Federated Prompt Learning (DP-FPL) approach to tackle this challenge by leveraging a low-rank adaptation scheme to capture generalization while maintaining a residual term that preserves expressiveness for personalization. To ensure privacy, we introduce a novel method where we apply local differential privacy to the two low-rank components of the local prompt, and global differential privacy to the global prompt. Our approach mitigates the impact of privacy noise on the model performance while balancing the tradeoff between personalization and generalization. Extensive experiments demonstrate the effectiveness of our approach over other benchmarks.

* Accepted to ICLR 2025 main conference track

Via

Access Paper or Ask Questions

A Differentially Private Blockchain-Based Approach for Vertical Federated Learning

Jul 09, 2024

Linh Tran, Sanjay Chari, Md. Saikat Islam Khan, Aaron Zachariah, Stacy Patterson, Oshani Seneviratne

Abstract:We present the Differentially Private Blockchain-Based Vertical Federal Learning (DP-BBVFL) algorithm that provides verifiability and privacy guarantees for decentralized applications. DP-BBVFL uses a smart contract to aggregate the feature representations, i.e., the embeddings, from clients transparently. We apply local differential privacy to provide privacy for embeddings stored on a blockchain, hence protecting the original data. We provide the first prototype application of differential privacy with blockchain for vertical federated learning. Our experiments with medical data show that DP-BBVFL achieves high accuracy with a tradeoff in training time due to on-chain aggregation. This innovative fusion of differential privacy and blockchain technology in DP-BBVFL could herald a new era of collaborative and trustworthy machine learning applications across several decentralized application domains.

Via

Access Paper or Ask Questions

A Survey of Privacy Threats and Defense in Vertical Federated Learning: From Model Life Cycle Perspective

Feb 06, 2024

Lei Yu, Meng Han, Yiming Li, Changting Lin, Yao Zhang, Mingyang Zhang, Yan Liu, Haiqin Weng, Yuseok Jeon, Ka-Ho Chow(+1 more)

Abstract:Vertical Federated Learning (VFL) is a federated learning paradigm where multiple participants, who share the same set of samples but hold different features, jointly train machine learning models. Although VFL enables collaborative machine learning without sharing raw data, it is still susceptible to various privacy threats. In this paper, we conduct the first comprehensive survey of the state-of-the-art in privacy attacks and defenses in VFL. We provide taxonomies for both attacks and defenses, based on their characterizations, and discuss open challenges and future research directions. Specifically, our discussion is structured around the model's life cycle, by delving into the privacy threats encountered during different stages of machine learning and their corresponding countermeasures. This survey not only serves as a resource for the research community but also offers clear guidance and actionable insights for practitioners to safeguard data privacy throughout the model's life cycle.

Via

Access Paper or Ask Questions

LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning

May 03, 2023

Timothy Castiglia, Yi Zhou, Shiqiang Wang, Swanand Kadhe, Nathalie Baracaldo, Stacy Patterson

Figure 1 for LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning

Figure 2 for LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning

Figure 3 for LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning

Figure 4 for LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning

Abstract:We propose LESS-VFL, a communication-efficient feature selection method for distributed systems with vertically partitioned data. We consider a system of a server and several parties with local datasets that share a sample ID space but have different feature sets. The parties wish to collaboratively train a model for a prediction task. As part of the training, the parties wish to remove unimportant features in the system to improve generalization, efficiency, and explainability. In LESS-VFL, after a short pre-training period, the server optimizes its part of the global model to determine the relevant outputs from party models. This information is shared with the parties to then allow local feature selection without communication. We analytically prove that LESS-VFL removes spurious features from model training. We provide extensive empirical evidence that LESS-VFL can achieve high accuracy and remove spurious features at a fraction of the communication cost of other feature selection approaches.

* Published in ICML 2023

Via

Access Paper or Ask Questions

Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data

Jun 16, 2022

Timothy Castiglia, Anirban Das, Shiqiang Wang, Stacy Patterson

Figure 1 for Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data

Figure 2 for Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data

Figure 3 for Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data

Figure 4 for Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data

Abstract:We propose Compressed Vertical Federated Learning (C-VFL) for communication-efficient training on vertically partitioned data. In C-VFL, a server and multiple parties collaboratively train a model on their respective features utilizing several local iterations and sharing compressed intermediate results periodically. Our work provides the first theoretical analysis of the effect message compression has on distributed training over vertically partitioned data. We prove convergence of non-convex objectives at a rate of $O(\frac{1}{\sqrt{T}})$ when the compression error is bounded over the course of training. We provide specific requirements for convergence with common compression techniques, such as quantization and top-$k$ sparsification. Finally, we experimentally show compression can reduce communication by over $90\%$ without a significant decrease in accuracy over VFL without compression.

Via

Access Paper or Ask Questions

Cross-Silo Federated Learning for Multi-Tier Networks with Vertical and Horizontal Data Partitioning

Aug 19, 2021

Anirban Das, Shiqiang Wang, Stacy Patterson

Figure 1 for Cross-Silo Federated Learning for Multi-Tier Networks with Vertical and Horizontal Data Partitioning

Figure 2 for Cross-Silo Federated Learning for Multi-Tier Networks with Vertical and Horizontal Data Partitioning

Figure 3 for Cross-Silo Federated Learning for Multi-Tier Networks with Vertical and Horizontal Data Partitioning

Figure 4 for Cross-Silo Federated Learning for Multi-Tier Networks with Vertical and Horizontal Data Partitioning

Abstract:We consider federated learning in tiered communication networks. Our network model consists of a set of silos, each holding a vertical partition of the data. Each silo contains a hub and a set of clients, with the silo's vertical data shard partitioned horizontally across its clients. We propose Tiered Decentralized Coordinate Descent (TDCD), a communication-efficient decentralized training algorithm for such two-tiered networks. To reduce communication overhead, the clients in each silo perform multiple local gradient steps before sharing updates with their hub. Each hub adjusts its coordinates by averaging its workers' updates, and then hubs exchange intermediate updates with one another. We present a theoretical analysis of our algorithm and show the dependence of the convergence rate on the number of vertical partitions, the number of local updates, and the number of clients in each hub. We further validate our approach empirically via simulation-based experiments using a variety of datasets and objectives.

* 25 Pages, 11 Figures, Under Review. arXiv admin note: text overlap with arXiv:2102.03620

Via

Access Paper or Ask Questions

Multi-Tier Federated Learning for Vertically Partitioned Data

Feb 06, 2021

Anirban Das, Stacy Patterson

Figure 1 for Multi-Tier Federated Learning for Vertically Partitioned Data

Figure 2 for Multi-Tier Federated Learning for Vertically Partitioned Data

Figure 3 for Multi-Tier Federated Learning for Vertically Partitioned Data

Abstract:We consider decentralized model training in tiered communication networks. Our network model consists of a set of silos, each holding a vertical partition of the data. Each silo contains a hub and a set of clients, with the silo's vertical data shard partitioned horizontally across its clients. We propose Tiered Decentralized Coordinate Descent (TDCD), a communication-efficient decentralized training algorithm for such two-tiered networks. To reduce communication overhead, the clients in each silo perform multiple local gradient steps before sharing updates with their hub. Each hub adjusts its coordinates by averaging its workers' updates, and then hubs exchange intermediate updates with one another. We present a theoretical analysis of our algorithm and show the dependence of the convergence rate on the number of vertical partitions, the number of local updates, and the number of clients in each hub. We further validate our approach empirically via simulation-based experiments using a variety of datasets and both convex and non-convex objectives.

* 11 pages, 3 figures, To be published in ICASSP 2021

Via

Access Paper or Ask Questions

Multi-Level Local SGD for Heterogeneous Hierarchical Networks

Jul 27, 2020

Timothy Castiglia, Anirban Das, Stacy Patterson

Figure 1 for Multi-Level Local SGD for Heterogeneous Hierarchical Networks

Figure 2 for Multi-Level Local SGD for Heterogeneous Hierarchical Networks

Figure 3 for Multi-Level Local SGD for Heterogeneous Hierarchical Networks

Figure 4 for Multi-Level Local SGD for Heterogeneous Hierarchical Networks

Abstract:We propose Multi-Level Local SGD, a distributed gradient method for learning a smooth, non-convex objective in a heterogeneous multi-level network. Our network model consists of a set of disjoint sub-networks, with a single hub and multiple worker nodes; further, worker nodes may have different operating rates. The hubs exchange information with one another via a connected, but not necessarily complete communication network. In our algorithm, sub-networks execute a distributed SGD algorithm, using a hub-and-spoke paradigm, and the hubs periodically average their models with neighboring hubs. We first provide a unified mathematical framework that describes the Multi-Level Local SGD algorithm. We then present a theoretical analysis of the algorithm; our analysis shows the dependence of the convergence error on the worker node heterogeneity, hub network topology, and the number of local, sub-network, and global iterations. We back up our theoretical results via simulation-based experiments using both convex and non-convex objectives.

* 34 pages, 5 figures

Via

Access Paper or Ask Questions

BubbleTouch: A Quasi-Static Tactile Skin Simulator

Sep 24, 2018

Brayden Hollis, Stacy Patterson, Jinda Cui, Jeff Trinkle

Figure 1 for BubbleTouch: A Quasi-Static Tactile Skin Simulator

Figure 2 for BubbleTouch: A Quasi-Static Tactile Skin Simulator

Figure 3 for BubbleTouch: A Quasi-Static Tactile Skin Simulator

Abstract:We present BubbleTouch, an open source quasi-static simulator for robotic tactile skins. BubbleTouch can be used to simulate contact with a robot's tactile skin patches as it interacts with humans and objects. The simulator creates detailed traces of contact forces that can be used in experiments in tactile contact activities. We summarize the design of BubbleTouch and highlight our recent work that uses BubbleTouch for experiments with tactile object recognition.

* Presented at AI-HRI AAAI-FSS, 2018 (arXiv:1809.06606)

Via

Access Paper or Ask Questions