Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Danni Peng

Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning

Jan 03, 2025

Danni Peng, Yuan Wang, Huazhu Fu, Jinpeng Jiang, Yong Liu, Rick Siow Mong Goh, Qingsong Wei

Figure 1 for Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning

Figure 2 for Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning

Figure 3 for Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning

Figure 4 for Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning

Abstract:Personalized federated learning (PFL) studies effective model personalization to address the data heterogeneity issue among clients in traditional federated learning (FL). Existing PFL approaches mainly generate personalized models by relying solely on the clients' latest updated models while ignoring their previous updates, which may result in suboptimal personalized model learning. To bridge this gap, we propose a novel framework termed pFedSeq, designed for personalizing adapters to fine-tune a foundation model in FL. In pFedSeq, the server maintains and trains a sequential learner, which processes a sequence of past adapter updates from clients and generates calibrations for personalized adapters. To effectively capture the cross-client and cross-step relations hidden in previous updates and generate high-performing personalized adapters, pFedSeq adopts the powerful selective state space model (SSM) as the architecture of sequential learner. Through extensive experiments on four public benchmark datasets, we demonstrate the superiority of pFedSeq over state-of-the-art PFL methods.

* Accepted by AAAI 2025

Via

Access Paper or Ask Questions

Learning Gradient-based Mixup towards Flatter Minima for Domain Generalization

Sep 29, 2022

Danni Peng, Sinno Jialin Pan

Figure 1 for Learning Gradient-based Mixup towards Flatter Minima for Domain Generalization

Figure 2 for Learning Gradient-based Mixup towards Flatter Minima for Domain Generalization

Figure 3 for Learning Gradient-based Mixup towards Flatter Minima for Domain Generalization

Figure 4 for Learning Gradient-based Mixup towards Flatter Minima for Domain Generalization

Abstract:To address the distribution shifts between training and test data, domain generalization (DG) leverages multiple source domains to learn a model that generalizes well to unseen domains. However, existing DG methods generally suffer from overfitting to the source domains, partly due to the limited coverage of the expected region in feature space. Motivated by this, we propose to perform mixup with data interpolation and extrapolation to cover the potential unseen regions. To prevent the detrimental effects of unconstrained extrapolation, we carefully design a policy to generate the instance weights, named Flatness-aware Gradient-based Mixup (FGMix). The policy employs a gradient-based similarity to assign greater weights to instances that carry more invariant information, and learns the similarity function towards flatter minima for better generalization. On the DomainBed benchmark, we validate the efficacy of various designs of FGMix and demonstrate its superiority over other DG algorithms.

* 22 pages, 14 figures

Via

Access Paper or Ask Questions

Learning an Adaptive Meta Model-Generator for Incrementally Updating Recommender Systems

Nov 08, 2021

Danni Peng, Sinno Jialin Pan, Jie Zhang, Anxiang Zeng

Figure 1 for Learning an Adaptive Meta Model-Generator for Incrementally Updating Recommender Systems

Figure 2 for Learning an Adaptive Meta Model-Generator for Incrementally Updating Recommender Systems

Figure 3 for Learning an Adaptive Meta Model-Generator for Incrementally Updating Recommender Systems

Figure 4 for Learning an Adaptive Meta Model-Generator for Incrementally Updating Recommender Systems

Abstract:Recommender Systems (RSs) in real-world applications often deal with billions of user interactions daily. To capture the most recent trends effectively, it is common to update the model incrementally using only the newly arrived data. However, this may impede the model's ability to retain long-term information due to the potential overfitting and forgetting issues. To address this problem, we propose a novel Adaptive Sequential Model Generation (ASMG) framework, which generates a better serving model from a sequence of historical models via a meta generator. For the design of the meta generator, we propose to employ Gated Recurrent Units (GRUs) to leverage its ability to capture the long-term dependencies. We further introduce some novel strategies to apply together with the GRU meta generator, which not only improve its computational efficiency but also enable more accurate sequential modeling. By instantiating the model-agnostic framework on a general deep learning-based RS model, we demonstrate that our method achieves state-of-the-art performance on three public datasets and one industrial dataset.

* 11 pages, 6 figures, accepted by RecSys 2021

Via

Access Paper or Ask Questions