Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Transformer-based Dual Relation Graph for Multi-label Image Recognition

Oct 12, 2021

Jiawei Zhao, Ke Yan, Yifan Zhao, Xiaowei Guo, Feiyue Huang, Jia Li

Figure 1 for Transformer-based Dual Relation Graph for Multi-label Image Recognition

Figure 2 for Transformer-based Dual Relation Graph for Multi-label Image Recognition

Figure 3 for Transformer-based Dual Relation Graph for Multi-label Image Recognition

Figure 4 for Transformer-based Dual Relation Graph for Multi-label Image Recognition

Share this with someone who'll enjoy it:

Abstract:The simultaneous recognition of multiple objects in one image remains a challenging task, spanning multiple events in the recognition field such as various object scales, inconsistent appearances, and confused inter-class relationships. Recent research efforts mainly resort to the statistic label co-occurrences and linguistic word embedding to enhance the unclear semantics. Different from these researches, in this paper, we propose a novel Transformer-based Dual Relation learning framework, constructing complementary relationships by exploring two aspects of correlation, i.e., structural relation graph and semantic relation graph. The structural relation graph aims to capture long-range correlations from object context, by developing a cross-scale transformer-based architecture. The semantic graph dynamically models the semantic meanings of image objects with explicit semantic-aware constraints. In addition, we also incorporate the learnt structural relationship into the semantic graph, constructing a joint relation graph for robust representations. With the collaborative learning of these two effective relation graphs, our approach achieves new state-of-the-art on two popular multi-label recognition benchmarks, i.e., MS-COCO and VOC 2007 dataset.

* In Proceedings of the IEEE/CVF International Conference on Computer Vision 2021 (pp. 163-172) * 10 pages, 5 figures. Published in ICCV 2021

View paper on

Share this with someone who'll enjoy it:

Title:Transformer-based Dual Relation Graph for Multi-label Image Recognition

Paper and Code