Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Mask-CNN: Localizing Parts and Selecting Descriptors for Fine-Grained Image Recognition

May 23, 2016

Xiu-Shen Wei, Chen-Wei Xie, Jianxin Wu

Figure 1 for Mask-CNN: Localizing Parts and Selecting Descriptors for Fine-Grained Image Recognition

Figure 2 for Mask-CNN: Localizing Parts and Selecting Descriptors for Fine-Grained Image Recognition

Figure 3 for Mask-CNN: Localizing Parts and Selecting Descriptors for Fine-Grained Image Recognition

Figure 4 for Mask-CNN: Localizing Parts and Selecting Descriptors for Fine-Grained Image Recognition

Share this with someone who'll enjoy it:

Abstract:Fine-grained image recognition is a challenging computer vision problem, due to the small inter-class variations caused by highly similar subordinate categories, and the large intra-class variations in poses, scales and rotations. In this paper, we propose a novel end-to-end Mask-CNN model without the fully connected layers for fine-grained recognition. Based on the part annotations of fine-grained images, the proposed model consists of a fully convolutional network to both locate the discriminative parts (e.g., head and torso), and more importantly generate object/part masks for selecting useful and meaningful convolutional descriptors. After that, a four-stream Mask-CNN model is built for aggregating the selected object- and part-level descriptors simultaneously. The proposed Mask-CNN model has the smallest number of parameters, lowest feature dimensionality and highest recognition accuracy when compared with state-of-the-arts fine-grained approaches.

* Submitted to NIPS 2016

View paper on

Share this with someone who'll enjoy it:

Title:Mask-CNN: Localizing Parts and Selecting Descriptors for Fine-Grained Image Recognition

Paper and Code