Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Self-supervised Multi-view Disentanglement for Expansion of Visual Collections

Feb 04, 2023

Nihal Jain, Praneetha Vaddamanu, Paridhi Maheshwari, Vishwa Vinay, Kuldeep Kulkarni

Figure 1 for Self-supervised Multi-view Disentanglement for Expansion of Visual Collections

Figure 2 for Self-supervised Multi-view Disentanglement for Expansion of Visual Collections

Figure 3 for Self-supervised Multi-view Disentanglement for Expansion of Visual Collections

Figure 4 for Self-supervised Multi-view Disentanglement for Expansion of Visual Collections

Share this with someone who'll enjoy it:

Abstract:Image search engines enable the retrieval of images relevant to a query image. In this work, we consider the setting where a query for similar images is derived from a collection of images. For visual search, the similarity measurements may be made along multiple axes, or views, such as style and color. We assume access to a set of feature extractors, each of which computes representations for a specific view. Our objective is to design a retrieval algorithm that effectively combines similarities computed over representations from multiple views. To this end, we propose a self-supervised learning method for extracting disentangled view-specific representations for images such that the inter-view overlap is minimized. We show how this allows us to compute the intent of a collection as a distribution over views. We show how effective retrieval can be performed by prioritizing candidate expansion images that match the intent of a query collection. Finally, we present a new querying mechanism for image search enabled by composing multiple collections and perform retrieval under this setting using the techniques presented in this paper.

* A version of this paper has been accepted at WSDM 2023

View paper on

Share this with someone who'll enjoy it:

Title:Self-supervised Multi-view Disentanglement for Expansion of Visual Collections

Paper and Code