Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:TAPTRv2: Attention-based Position Update Improves Tracking Any Point

Jul 23, 2024

Hongyang Li, Hao Zhang, Shilong Liu, Zhaoyang Zeng, Feng Li, Tianhe Ren, Bohan Li, Lei Zhang

Figure 1 for TAPTRv2: Attention-based Position Update Improves Tracking Any Point

Figure 2 for TAPTRv2: Attention-based Position Update Improves Tracking Any Point

Figure 3 for TAPTRv2: Attention-based Position Update Improves Tracking Any Point

Figure 4 for TAPTRv2: Attention-based Position Update Improves Tracking Any Point

Share this with someone who'll enjoy it:

Abstract:In this paper, we present TAPTRv2, a Transformer-based approach built upon TAPTR for solving the Tracking Any Point (TAP) task. TAPTR borrows designs from DEtection TRansformer (DETR) and formulates each tracking point as a point query, making it possible to leverage well-studied operations in DETR-like algorithms. TAPTRv2 improves TAPTR by addressing a critical issue regarding its reliance on cost-volume,which contaminates the point query\'s content feature and negatively impacts both visibility prediction and cost-volume computation. In TAPTRv2, we propose a novel attention-based position update (APU) operation and use key-aware deformable attention to realize. For each query, this operation uses key-aware attention weights to combine their corresponding deformable sampling positions to predict a new query position. This design is based on the observation that local attention is essentially the same as cost-volume, both of which are computed by dot-production between a query and its surrounding features. By introducing this new operation, TAPTRv2 not only removes the extra burden of cost-volume computation, but also leads to a substantial performance improvement. TAPTRv2 surpasses TAPTR and achieves state-of-the-art performance on many challenging datasets, demonstrating the superiority

View paper on

Share this with someone who'll enjoy it:

Title:TAPTRv2: Attention-based Position Update Improves Tracking Any Point

Paper and Code