Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis

Nov 07, 2023

Dipanjyoti Paul, Arpita Chowdhury, Xinqi Xiong, Feng-Ju Chang, David Carlyn, Samuel Stevens, Kaiya Provost, Anuj Karpatne, Bryan Carstens, Daniel Rubenstein(+4 more)

Figure 1 for A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis

Figure 2 for A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis

Figure 3 for A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis

Figure 4 for A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis

Share this with someone who'll enjoy it:

Abstract:We present a novel usage of Transformers to make image classification interpretable. Unlike mainstream classifiers that wait until the last fully-connected layer to incorporate class information to make predictions, we investigate a proactive approach, asking each class to search for itself in an image. We realize this idea via a Transformer encoder-decoder inspired by DEtection TRansformer (DETR). We learn ``class-specific'' queries (one for each class) as input to the decoder, enabling each class to localize its patterns in an image via cross-attention. We name our approach INterpretable TRansformer (INTR), which is fairly easy to implement and exhibits several compelling properties. We show that INTR intrinsically encourages each class to attend distinctively; the cross-attention weights thus provide a faithful interpretation of the prediction. Interestingly, via ``multi-head'' cross-attention, INTR could identify different ``attributes'' of a class, making it particularly suitable for fine-grained classification and analysis, which we demonstrate on eight datasets. Our code and pre-trained model are publicly accessible at https://github.com/Imageomics/INTR.

View paper on

Share this with someone who'll enjoy it:

Title:A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis

Paper and Code