Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:R2-Trans:Fine-Grained Visual Categorization with Redundancy Reduction

Apr 21, 2022

Yu Wang, Shuo Ye, Shujian Yu, Xinge You

Figure 1 for R2-Trans:Fine-Grained Visual Categorization with Redundancy Reduction

Figure 2 for R2-Trans:Fine-Grained Visual Categorization with Redundancy Reduction

Figure 3 for R2-Trans:Fine-Grained Visual Categorization with Redundancy Reduction

Figure 4 for R2-Trans:Fine-Grained Visual Categorization with Redundancy Reduction

Share this with someone who'll enjoy it:

Abstract:Fine-grained visual categorization (FGVC) aims to discriminate similar subcategories, whose main challenge is the large intraclass diversities and subtle inter-class differences. Existing FGVC methods usually select discriminant regions found by a trained model, which is prone to neglect other potential discriminant information. On the other hand, the massive interactions between the sequence of image patches in ViT make the resulting class-token contain lots of redundant information, which may also impacts FGVC performance. In this paper, we present a novel approach for FGVC, which can simultaneously make use of partial yet sufficient discriminative information in environmental cues and also compress the redundant information in class-token with respect to the target. Specifically, our model calculates the ratio of high-weight regions in a batch, adaptively adjusts the masking threshold and achieves moderate extraction of background information in the input space. Moreover, we also use the Information Bottleneck~(IB) approach to guide our network to learn a minimum sufficient representations in the feature space. Experimental results on three widely-used benchmark datasets verify that our approach can achieve outperforming performance than other state-of-the-art approaches and baseline models.

View paper on

Share this with someone who'll enjoy it:

Title:R2-Trans:Fine-Grained Visual Categorization with Redundancy Reduction

Paper and Code