Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jishuo Li

OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection

Nov 26, 2024

Zhongyu Xia, Jishuo Li, Zhiwei Lin, Xinhao Wang, Yongtao Wang, Ming-Hsuan Yang

Figure 1 for OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection

Figure 2 for OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection

Figure 3 for OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection

Figure 4 for OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection

Abstract:Open-world autonomous driving encompasses domain generalization and open-vocabulary. Domain generalization refers to the capabilities of autonomous driving systems across different scenarios and sensor parameter configurations. Open vocabulary pertains to the ability to recognize various semantic categories not encountered during training. In this paper, we introduce OpenAD, the first real-world open-world autonomous driving benchmark for 3D object detection. OpenAD is built on a corner case discovery and annotation pipeline integrating with a multimodal large language model (MLLM). The proposed pipeline annotates corner case objects in a unified format for five autonomous driving perception datasets with 2000 scenarios. In addition, we devise evaluation methodologies and evaluate various 2D and 3D open-world and specialized models. Moreover, we propose a vision-centric 3D open-world object detection baseline and further introduce an ensemble method by fusing general and specialized models to address the issue of lower precision in existing open-world methods for the OpenAD benchmark. Annotations, toolkit code, and all evaluation codes will be released.

Via

Access Paper or Ask Questions