Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Self-supervised Learning of Contextualized Local Visual Embeddings

Oct 04, 2023

Thalles Santos Silva, Helio Pedrini, Adín Ramírez Rivera

Share this with someone who'll enjoy it:

Abstract:We present Contextualized Local Visual Embeddings (CLoVE), a self-supervised convolutional-based method that learns representations suited for dense prediction tasks. CLoVE deviates from current methods and optimizes a single loss function that operates at the level of contextualized local embeddings learned from output feature maps of convolution neural network (CNN) encoders. To learn contextualized embeddings, CLoVE proposes a normalized mult-head self-attention layer that combines local features from different parts of an image based on similarity. We extensively benchmark CLoVE's pre-trained representations on multiple datasets. CLoVE reaches state-of-the-art performance for CNN-based architectures in 4 dense prediction downstream tasks, including object detection, instance segmentation, keypoint detection, and dense pose estimation.

* 4th Visual Inductive Priors for Data-Efficient Deep Learning Workshop ICCV 2023 * Pre-print. 4th Visual Inductive Priors for Data-Efficient Deep Learning Workshop ICCV 2023. Code at https://github.com/sthalles/CLoVE

View paper on

Share this with someone who'll enjoy it:

Title:Self-supervised Learning of Contextualized Local Visual Embeddings

Paper and Code