Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:LidarCLIP or: How I Learned to Talk to Point Clouds

Dec 13, 2022

Georg Hess, Adam Tonderski, Christoffer Petersson, Lennart Svensson, Kalle Åström

Figure 1 for LidarCLIP or: How I Learned to Talk to Point Clouds

Figure 2 for LidarCLIP or: How I Learned to Talk to Point Clouds

Figure 3 for LidarCLIP or: How I Learned to Talk to Point Clouds

Figure 4 for LidarCLIP or: How I Learned to Talk to Point Clouds

Share this with someone who'll enjoy it:

Abstract:Research connecting text and images has recently seen several breakthroughs, with models like CLIP, DALL-E 2, and Stable Diffusion. However, the connection between text and other visual modalities, such as lidar data, has received less attention, prohibited by the lack of text-lidar datasets. In this work, we propose LidarCLIP, a mapping from automotive point clouds to a pre-existing CLIP embedding space. Using image-lidar pairs, we supervise a point cloud encoder with the image CLIP embeddings, effectively relating text and lidar data with the image domain as an intermediary. We show the effectiveness of LidarCLIP by demonstrating that lidar-based retrieval is generally on par with image-based retrieval, but with complementary strengths and weaknesses. By combining image and lidar features, we improve upon both single-modality methods and enable a targeted search for challenging detection scenarios under adverse sensor conditions. We also use LidarCLIP as a tool to investigate fundamental lidar capabilities through natural language. Finally, we leverage our compatibility with CLIP to explore a range of applications, such as point cloud captioning and lidar-to-image generation, without any additional training. We hope LidarCLIP can inspire future work to dive deeper into connections between text and point cloud understanding. Code and trained models available at https://github.com/atonderski/lidarclip.

View paper on

Share this with someone who'll enjoy it:

Title:LidarCLIP or: How I Learned to Talk to Point Clouds

Paper and Code