Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

Oct 11, 2022

Nur Muhammad Mahi Shafiullah, Chris Paxton, Lerrel Pinto, Soumith Chintala, Arthur Szlam

Figure 1 for CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

Figure 2 for CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

Figure 3 for CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

Figure 4 for CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

Share this with someone who'll enjoy it:

Abstract:We propose CLIP-Fields, an implicit scene model that can be trained with no direct human supervision. This model learns a mapping from spatial locations to semantic embedding vectors. The mapping can then be used for a variety of tasks, such as segmentation, instance identification, semantic search over space, and view localization. Most importantly, the mapping can be trained with supervision coming only from web-image and web-text trained models such as CLIP, Detic, and Sentence-BERT. When compared to baselines like Mask-RCNN, our method outperforms on few-shot instance identification or semantic segmentation on the HM3D dataset with only a fraction of the examples. Finally, we show that using CLIP-Fields as a scene memory, robots can perform semantic navigation in real-world environments. Our code and demonstrations are available here: https://mahis.life/clip-fields/

* Code, video, and interactive demonstrations available at https://mahis.life/clip-fields/

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

Paper and Code