Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:From Occlusion to Insight: Object Search in Semantic Shelves using Large Language Models

Feb 24, 2023

Satvik Sharma, Kaushik Shivakumar, Huang Huang, Ryan Hoque, Alishba Imran, Brian Ichter, Ken Goldberg

Figure 1 for From Occlusion to Insight: Object Search in Semantic Shelves using Large Language Models

Figure 2 for From Occlusion to Insight: Object Search in Semantic Shelves using Large Language Models

Figure 3 for From Occlusion to Insight: Object Search in Semantic Shelves using Large Language Models

Figure 4 for From Occlusion to Insight: Object Search in Semantic Shelves using Large Language Models

Share this with someone who'll enjoy it:

Abstract:How can a robot efficiently extract a desired object from a shelf when it is fully occluded by other objects? Prior works propose geometric approaches for this problem but do not consider object semantics. Shelves in pharmacies, restaurant kitchens, and grocery stores are often organized such that semantically similar objects are placed close to one another. Can large language models (LLMs) serve as semantic knowledge sources to accelerate robotic mechanical search in semantically arranged environments? With Semantic Spatial Search on Shelves (S^4), we use LLMs to generate affinity matrices, where entries correspond to semantic likelihood of physical proximity between objects. We derive semantic spatial distributions by synthesizing semantics with learned geometric constraints. S^4 incorporates Optical Character Recognition (OCR) and semantic refinement with predictions from ViLD, an open-vocabulary object detection model. Simulation experiments suggest that semantic spatial search reduces the search time relative to pure spatial search by an average of 24% across three domains: pharmacy, kitchen, and office shelves. A manually collected dataset of 100 semantic scenes suggests that OCR and semantic refinement improve object detection accuracy by 35%. Lastly, physical experiments in a pharmacy shelf suggest 47.1% improvement over pure spatial search. Supplementary material can be found at https://sites.google.com/view/s4-rss/home.

View paper on

Share this with someone who'll enjoy it:

Title:From Occlusion to Insight: Object Search in Semantic Shelves using Large Language Models

Paper and Code