Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:VISIR: Visual and Semantic Image Label Refinement

Sep 02, 2019

Sreyasi Nag Chowdhury, Niket Tandon, Hakan Ferhatosmanoglu, Gerhard Weikum

Figure 1 for VISIR: Visual and Semantic Image Label Refinement

Figure 2 for VISIR: Visual and Semantic Image Label Refinement

Figure 3 for VISIR: Visual and Semantic Image Label Refinement

Figure 4 for VISIR: Visual and Semantic Image Label Refinement

Share this with someone who'll enjoy it:

Abstract:The social media explosion has populated the Internet with a wealth of images. There are two existing paradigms for image retrieval: 1) content-based image retrieval (CBIR), which has traditionally used visual features for similarity search (e.g., SIFT features), and 2) tag-based image retrieval (TBIR), which has relied on user tagging (e.g., Flickr tags). CBIR now gains semantic expressiveness by advances in deep-learning-based detection of visual labels. TBIR benefits from query-and-click logs to automatically infer more informative labels. However, learning-based tagging still yields noisy labels and is restricted to concrete objects, missing out on generalizations and abstractions. Click-based tagging is limited to terms that appear in the textual context of an image or in queries that lead to a click. This paper addresses the above limitations by semantically refining and expanding the labels suggested by learning-based object detection. We consider the semantic coherence between the labels for different objects, leverage lexical and commonsense knowledge, and cast the label assignment into a constrained optimization problem solved by an integer linear program. Experiments show that our method, called VISIR, improves the quality of the state-of-the-art visual labeling tools like LSDA and YOLO.

* ACM ISBN 978-1-4503-5581-0/18/02 2018 * Published in WSDM 2018

View paper on

Share this with someone who'll enjoy it:

Title:VISIR: Visual and Semantic Image Label Refinement

Paper and Code