Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments

Jul 21, 2022

Khoi D. Nguyen, Quoc-Huy Tran, Khoi Nguyen, Binh-Son Hua, Rang Nguyen

Figure 1 for Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments

Figure 2 for Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments

Figure 3 for Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments

Figure 4 for Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments

Share this with someone who'll enjoy it:

Abstract:We present a novel method for few-shot video classification, which performs appearance and temporal alignments. In particular, given a pair of query and support videos, we conduct appearance alignment via frame-level feature matching to achieve the appearance similarity score between the videos, while utilizing temporal order-preserving priors for obtaining the temporal similarity score between the videos. Moreover, we introduce a few-shot video classification framework that leverages the above appearance and temporal similarity scores across multiple steps, namely prototype-based training and testing as well as inductive and transductive prototype refinement. To the best of our knowledge, our work is the first to explore transductive few-shot video classification. Extensive experiments on both Kinetics and Something-Something V2 datasets show that both appearance and temporal alignments are crucial for datasets with temporal order sensitivity such as Something-Something V2. Our approach achieves similar or better results than previous methods on both datasets. Our code is available at https://github.com/VinAIResearch/fsvc-ata.

* Accepted to ECCV 2022

View paper on

Share this with someone who'll enjoy it:

Title:Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments

Paper and Code