Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Improving Masked Autoencoders by Learning Where to Mask

Mar 12, 2023

Haijian Chen, Wendong Zhang, Yunbo Wang, Xiaokang Yang

Figure 1 for Improving Masked Autoencoders by Learning Where to Mask

Figure 2 for Improving Masked Autoencoders by Learning Where to Mask

Figure 3 for Improving Masked Autoencoders by Learning Where to Mask

Figure 4 for Improving Masked Autoencoders by Learning Where to Mask

Share this with someone who'll enjoy it:

Abstract:Masked image modeling is a promising self-supervised learning method for visual data. It is typically built upon image patches with random masks, which largely ignores the variation of information density between them. The question is: Is there a better masking strategy than random sampling and how can we learn it? We empirically study this problem and initially find that introducing object-centric priors in mask sampling can significantly improve the learned representations. Inspired by this observation, we present AutoMAE, a fully differentiable framework that uses Gumbel-Softmax to interlink an adversarially-trained mask generator and a mask-guided image modeling process. In this way, our approach can adaptively find patches with higher information density for different images, and further strike a balance between the information gain obtained from image reconstruction and its practical training difficulty. In our experiments, AutoMAE is shown to provide effective pretraining models on standard self-supervised benchmarks and downstream tasks.

* 14 pages, 8 figures. Under review

View paper on

Share this with someone who'll enjoy it:

Title:Improving Masked Autoencoders by Learning Where to Mask

Paper and Code