Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Blending Anti-Aliasing into Vision Transformer

Oct 28, 2021

Shengju Qian, Hao Shao, Yi Zhu, Mu Li, Jiaya Jia

Figure 1 for Blending Anti-Aliasing into Vision Transformer

Figure 2 for Blending Anti-Aliasing into Vision Transformer

Figure 3 for Blending Anti-Aliasing into Vision Transformer

Figure 4 for Blending Anti-Aliasing into Vision Transformer

Share this with someone who'll enjoy it:

Abstract:The transformer architectures, based on self-attention mechanism and convolution-free design, recently found superior performance and booming applications in computer vision. However, the discontinuous patch-wise tokenization process implicitly introduces jagged artifacts into attention maps, arising the traditional problem of aliasing for vision transformers. Aliasing effect occurs when discrete patterns are used to produce high frequency or continuous information, resulting in the indistinguishable distortions. Recent researches have found that modern convolution networks still suffer from this phenomenon. In this work, we analyze the uncharted problem of aliasing in vision transformer and explore to incorporate anti-aliasing properties. Specifically, we propose a plug-and-play Aliasing-Reduction Module(ARM) to alleviate the aforementioned issue. We investigate the effectiveness and generalization of the proposed method across multiple tasks and various vision transformer families. This lightweight design consistently attains a clear boost over several famous structures. Furthermore, our module also improves data efficiency and robustness of vision transformers.

* Accepted to NeurIPS 2021

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Blending Anti-Aliasing into Vision Transformer

Paper and Code