Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shikai Chen

Graph Attention Transformer Network for Multi-Label Image Classification

Mar 08, 2022

Jin Yuan, Shikai Chen, Yao Zhang, Zhongchao Shi, Xin Geng, Jianping Fan, Yong Rui

Figure 1 for Graph Attention Transformer Network for Multi-Label Image Classification

Figure 2 for Graph Attention Transformer Network for Multi-Label Image Classification

Figure 3 for Graph Attention Transformer Network for Multi-Label Image Classification

Figure 4 for Graph Attention Transformer Network for Multi-Label Image Classification

Abstract:Multi-label classification aims to recognize multiple objects or attributes from images. However, it is challenging to learn from proper label graphs to effectively characterize such inter-label correlations or dependencies. Current methods often use the co-occurrence probability of labels based on the training set as the adjacency matrix to model this correlation, which is greatly limited by the dataset and affects the model's generalization ability. In this paper, we propose a Graph Attention Transformer Network (GATN), a general framework for multi-label image classification that can effectively mine complex inter-label relationships. First, we use the cosine similarity based on the label word embedding as the initial correlation matrix, which can represent rich semantic information. Subsequently, we design the graph attention transformer layer to transfer this adjacency matrix to adapt to the current domain. Our extensive experiments have demonstrated that our proposed methods can achieve state-of-the-art performance on three datasets.

Via

Access Paper or Ask Questions

Facial Motion Prior Networks for Facial Expression Recognition

Feb 23, 2019

Yuedong Chen, Jianfeng Wang, Shikai Chen, Zhongchao Shi, Jianfei Cai

Figure 1 for Facial Motion Prior Networks for Facial Expression Recognition

Figure 2 for Facial Motion Prior Networks for Facial Expression Recognition

Figure 3 for Facial Motion Prior Networks for Facial Expression Recognition

Figure 4 for Facial Motion Prior Networks for Facial Expression Recognition

Abstract:Deep learning based facial expression recognition (FER) has received a lot of attention in the past few years. Most of the existing deep learning based FER methods do not consider domain knowledge well, which thereby fail to extract representative features. In this work, we propose a novel FER framework, named Facial Motion Prior Networks (FMPN). Particularly, we introduce an addition branch to generate a facial mask so as to focus on facial muscle moving regions. To guide the facial mask learning, we propose to incorporate prior domain knowledge by using the average differences between neutral faces and the corresponding expressive faces as the guidance. Extensive experiments on four facial expression benchmark datasets demonstrate the effectiveness of the proposed method, compared with the state-of-the-art approaches.

Via

Access Paper or Ask Questions