Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Towards Complex-query Referring Image Segmentation: A Novel Benchmark

Sep 29, 2023

Wei Ji, Li Li, Hao Fei, Xiangyan Liu, Xun Yang, Juncheng Li, Roger Zimmermann

Figure 1 for Towards Complex-query Referring Image Segmentation: A Novel Benchmark

Figure 2 for Towards Complex-query Referring Image Segmentation: A Novel Benchmark

Figure 3 for Towards Complex-query Referring Image Segmentation: A Novel Benchmark

Figure 4 for Towards Complex-query Referring Image Segmentation: A Novel Benchmark

Share this with someone who'll enjoy it:

Abstract:Referring Image Understanding (RIS) has been extensively studied over the past decade, leading to the development of advanced algorithms. However, there has been a lack of research investigating how existing algorithms should be benchmarked with complex language queries, which include more informative descriptions of surrounding objects and backgrounds (\eg \textit{"the black car."} vs. \textit{"the black car is parking on the road and beside the bus."}). Given the significant improvement in the semantic understanding capability of large pre-trained models, it is crucial to take a step further in RIS by incorporating complex language that resembles real-world applications. To close this gap, building upon the existing RefCOCO and Visual Genome datasets, we propose a new RIS benchmark with complex queries, namely \textbf{RIS-CQ}. The RIS-CQ dataset is of high quality and large scale, which challenges the existing RIS with enriched, specific and informative queries, and enables a more realistic scenario of RIS research. Besides, we present a nichetargeting method to better task the RIS-CQ, called dual-modality graph alignment model (\textbf{\textsc{DuMoGa}}), which outperforms a series of RIS methods.

View paper on

Share this with someone who'll enjoy it:

Title:Towards Complex-query Referring Image Segmentation: A Novel Benchmark

Paper and Code