Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sophia Sirko-Galouchenko

OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks

Apr 22, 2024

Sophia Sirko-Galouchenko, Alexandre Boulch, Spyros Gidaris, Andrei Bursuc, Antonin Vobecky, Patrick Pérez, Renaud Marlet

Figure 1 for OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks

Figure 2 for OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks

Figure 3 for OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks

Figure 4 for OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks

Abstract:We introduce a self-supervised pretraining method, called OcFeat, for camera-only Bird's-Eye-View (BEV) segmentation networks. With OccFeat, we pretrain a BEV network via occupancy prediction and feature distillation tasks. Occupancy prediction provides a 3D geometric understanding of the scene to the model. However, the geometry learned is class-agnostic. Hence, we add semantic information to the model in the 3D space through distillation from a self-supervised pretrained image foundation model. Models pretrained with our method exhibit improved BEV semantic segmentation performance, particularly in low-data scenarios. Moreover, empirical results affirm the efficacy of integrating feature distillation with 3D occupancy prediction in our pretraining approach.

Via

Access Paper or Ask Questions