Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:TTAN: Two-Stage Temporal Alignment Network for Few-shot Action Recognition

Jul 10, 2021

Shuyuan Li, Huabin Liu, Rui Qian, Yuxi Li, John See, Mengjuan Fei, Xiaoyuan Yu, Weiyao Lin

Figure 1 for TTAN: Two-Stage Temporal Alignment Network for Few-shot Action Recognition

Figure 2 for TTAN: Two-Stage Temporal Alignment Network for Few-shot Action Recognition

Figure 3 for TTAN: Two-Stage Temporal Alignment Network for Few-shot Action Recognition

Figure 4 for TTAN: Two-Stage Temporal Alignment Network for Few-shot Action Recognition

Share this with someone who'll enjoy it:

Abstract:Few-shot action recognition aims to recognize novel action classes (query) using just a few samples (support). The majority of current approaches follow the metric learning paradigm, which learns to compare the similarity between videos. Recently, it has been observed that directly measuring this similarity is not ideal since different action instances may show distinctive temporal distribution, resulting in severe misalignment issues across query and support videos. In this paper, we arrest this problem from two distinct aspects -- action duration misalignment and motion evolution misalignment. We address them sequentially through a Two-stage Temporal Alignment Network (TTAN). The first stage performs temporal transformation with the predicted affine warp parameters, while the second stage utilizes a cross-attention mechanism to coordinate the features of the support and query to a consistent evolution. Besides, we devise a novel multi-shot fusion strategy, which takes the misalignment among support samples into consideration. Ablation studies and visualizations demonstrate the role played by both stages in addressing the misalignment. Extensive experiments on benchmark datasets show the potential of the proposed method in achieving state-of-the-art performance for few-shot action recognition.

View paper on

Share this with someone who'll enjoy it:

Title:TTAN: Two-Stage Temporal Alignment Network for Few-shot Action Recognition

Paper and Code