Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Locality-Aware Inter-and Intra-Video Reconstruction for Self-Supervised Correspondence Learning

Mar 29, 2022

Liulei Li, Tianfei Zhou, Wenguan Wang, Lu Yang, Jianwu Li, Yi Yang

Figure 1 for Locality-Aware Inter-and Intra-Video Reconstruction for Self-Supervised Correspondence Learning

Figure 2 for Locality-Aware Inter-and Intra-Video Reconstruction for Self-Supervised Correspondence Learning

Figure 3 for Locality-Aware Inter-and Intra-Video Reconstruction for Self-Supervised Correspondence Learning

Figure 4 for Locality-Aware Inter-and Intra-Video Reconstruction for Self-Supervised Correspondence Learning

Share this with someone who'll enjoy it:

Abstract:Our target is to learn visual correspondence from unlabeled videos. We develop LIIR, a locality-aware inter-and intra-video reconstruction framework that fills in three missing pieces, i.e., instance discrimination, location awareness, and spatial compactness, of self-supervised correspondence learning puzzle. First, instead of most existing efforts focusing on intra-video self-supervision only, we exploit cross video affinities as extra negative samples within a unified, inter-and intra-video reconstruction scheme. This enables instance discriminative representation learning by contrasting desired intra-video pixel association against negative inter-video correspondence. Second, we merge position information into correspondence matching, and design a position shifting strategy to remove the side-effect of position encoding during inter-video affinity computation, making our LIIR location-sensitive. Third, to make full use of the spatial continuity nature of video data, we impose a compactness-based constraint on correspondence matching, yielding more sparse and reliable solutions. The learned representation surpasses self-supervised state-of-the-arts on label propagation tasks including objects, semantic parts, and keypoints.

* CVPR 2022. Code: https://github.com/0liliulei/LIIR

View paper on

Share this with someone who'll enjoy it:

Title:Locality-Aware Inter-and Intra-Video Reconstruction for Self-Supervised Correspondence Learning

Paper and Code