Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts

Feb 08, 2024

Zhili Liu, Kai Chen, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, James T. Kwok

Figure 1 for Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts

Figure 2 for Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts

Figure 3 for Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts

Figure 4 for Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts

Share this with someone who'll enjoy it:

Abstract:Masked Autoencoder~(MAE) is a prevailing self-supervised learning method that achieves promising results in model pre-training. However, when the various downstream tasks have data distributions different from the pre-training data, the semantically irrelevant pre-training information might result in negative transfer, impeding MAE's scalability. To address this issue, we propose a novel MAE-based pre-training paradigm, Mixture of Cluster-conditional Experts (MoCE), which can be trained once but provides customized pre-training models for diverse downstream tasks. Different from the mixture of experts (MoE), our MoCE trains each expert only with semantically relevant images by using cluster-conditional gates. Thus, each downstream task can be allocated to its customized model pre-trained with data most similar to the downstream data. Experiments on a collection of 11 downstream tasks show that MoCE outperforms the vanilla MAE by 2.45\% on average. It also obtains new state-of-the-art self-supervised learning results on detection and segmentation.

* Accepted by ICLR 2023

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts

Paper and Code