Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

B. V. K Vijaya Kumar

Attention Control with Metric Learning Alignment for Image Set-based Recognition

Aug 05, 2019

Xiaofeng Liu, Zhenhua Guo, Jane You, B. V. K Vijaya Kumar

Figure 1 for Attention Control with Metric Learning Alignment for Image Set-based Recognition

Figure 2 for Attention Control with Metric Learning Alignment for Image Set-based Recognition

Figure 3 for Attention Control with Metric Learning Alignment for Image Set-based Recognition

Figure 4 for Attention Control with Metric Learning Alignment for Image Set-based Recognition

Abstract:This paper considers the problem of image set-based face verification and identification. Unlike traditional single sample (an image or a video) setting, this situation assumes the availability of a set of heterogeneous collection of orderless images and videos. The samples can be taken at different check points, different identity documents $etc$. The importance of each image is usually considered either equal or based on a quality assessment of that image independent of other images and/or videos in that image set. How to model the relationship of orderless images within a set remains a challenge. We address this problem by formulating it as a Markov Decision Process (MDP) in a latent space. Specifically, we first propose a dependency-aware attention control (DAC) network, which uses actor-critic reinforcement learning for attention decision of each image to exploit the correlations among the unordered images. An off-policy experience replay is introduced to speed up the learning process. Moreover, the DAC is combined with a temporal model for videos using divide and conquer strategies. We also introduce a pose-guided representation (PGR) scheme that can further boost the performance at extreme poses. We propose a parameter-free PGR without the need for training as well as a novel metric learning-based PGR for pose alignment without the need for pose detection in testing stage. Extensive evaluations on IJB-A/B/C, YTF, Celebrity-1000 datasets demonstrate that our method outperforms many state-of-art approaches on the set-based as well as video-based face recognition databases.

* Accepted to IEEE T-IFS (Extension of ECCV 2018 paper: Dependency-aware Attention Control for Unconstrained Face Recognition with Image Sets). arXiv admin note: substantial text overlap with arXiv:1907.03030; text overlap with arXiv:1707.00130 by other authors

Via

Access Paper or Ask Questions

Dependency-aware Attention Control for Unconstrained Face Recognition with Image Sets

Jul 05, 2019

Xiaofeng Liu, B. V. K Vijaya Kumar, Chao Yang, Qingming Tang, Jane You

Figure 1 for Dependency-aware Attention Control for Unconstrained Face Recognition with Image Sets

Figure 2 for Dependency-aware Attention Control for Unconstrained Face Recognition with Image Sets

Figure 3 for Dependency-aware Attention Control for Unconstrained Face Recognition with Image Sets

Figure 4 for Dependency-aware Attention Control for Unconstrained Face Recognition with Image Sets

Abstract:This paper targets the problem of image set-based face verification and identification. Unlike traditional single media (an image or video) setting, we encounter a set of heterogeneous contents containing orderless images and videos. The importance of each image is usually considered either equal or based on their independent quality assessment. How to model the relationship of orderless images within a set remains a challenge. We address this problem by formulating it as a Markov Decision Process (MDP) in the latent space. Specifically, we first present a dependency-aware attention control (DAC) network, which resorts to actor-critic reinforcement learning for sequential attention decision of each image embedding to fully exploit the rich correlation cues among the unordered images. Moreover, we introduce its sample-efficient variant with off-policy experience replay to speed up the learning process. The pose-guided representation scheme can further boost the performance at the extremes of the pose variation.

* Fixed the unreadable code in CVF version

Via

Access Paper or Ask Questions