Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries

Aug 02, 2022

Junru Gu, Chenxu Hu, Tianyuan Zhang, Xuanyao Chen, Yilun Wang, Yue Wang, Hang Zhao

Figure 1 for ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries

Figure 2 for ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries

Figure 3 for ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries

Figure 4 for ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries

Share this with someone who'll enjoy it:

Abstract:Existing autonomous driving pipelines separate the perception module from the prediction module. The two modules communicate via hand-picked features such as agent boxes and trajectories as interfaces. Due to this separation, the prediction module only receives partial information from the perception module. Even worse, errors from the perception modules can propagate and accumulate, adversely affecting the prediction results. In this work, we propose ViP3D, a visual trajectory prediction pipeline that leverages the rich information from raw videos to predict future trajectories of agents in a scene. ViP3D employs sparse agent queries throughout the pipeline, making it fully differentiable and interpretable. Furthermore, we propose an evaluation metric for this novel end-to-end visual trajectory prediction task. Extensive experimental results on the nuScenes dataset show the strong performance of ViP3D over traditional pipelines and previous end-to-end models.

* Project page is at https://tsinghua-mars-lab.github.io/ViP3D

View paper on

Share this with someone who'll enjoy it:

Title:ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries

Paper and Code