Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:BEVTrack: A Simple Baseline for 3D Single Object Tracking in Bird's-Eye View

Sep 12, 2023

Yuxiang Yang, Yingqi Deng, Jiahao Nie, Jing Zhang

Figure 1 for BEVTrack: A Simple Baseline for 3D Single Object Tracking in Bird's-Eye View

Figure 2 for BEVTrack: A Simple Baseline for 3D Single Object Tracking in Bird's-Eye View

Figure 3 for BEVTrack: A Simple Baseline for 3D Single Object Tracking in Bird's-Eye View

Figure 4 for BEVTrack: A Simple Baseline for 3D Single Object Tracking in Bird's-Eye View

Share this with someone who'll enjoy it:

Abstract:3D single object tracking (SOT) in point clouds is still a challenging problem due to appearance variation, distractors, and high sparsity of point clouds. Notably, in autonomous driving scenarios, the target object typically maintains spatial adjacency across consecutive frames, predominantly moving horizontally. This spatial continuity offers valuable prior knowledge for target localization. However, existing trackers, which often employ point-wise representations, struggle to efficiently utilize this knowledge owing to the irregular format of such representations. Consequently, they require elaborate designs and solving multiple subtasks to establish spatial correspondence. In this paper, we introduce BEVTrack, a simple yet strong baseline framework for 3D SOT. After converting consecutive point clouds into the common Bird's-Eye View representation, BEVTrack inherently encodes spatial proximity and adeptly captures motion cues for tracking via a simple element-wise operation and convolutional layers. Additionally, to better deal with objects having diverse sizes and moving patterns, BEVTrack directly learns the underlying motion distribution rather than making a fixed Laplacian or Gaussian assumption as in previous works. Without bells and whistles, BEVTrack achieves state-of-the-art performance on KITTI and NuScenes datasets while maintaining a high inference speed of 122 FPS. The code will be released at https://github.com/xmm-prio/BEVTrack.

* Technical report. Work in progress. Typo correction. The code will be released at https://github.com/xmm-prio/BEVTrack

View paper on

Share this with someone who'll enjoy it:

Title:BEVTrack: A Simple Baseline for 3D Single Object Tracking in Bird's-Eye View

Paper and Code