Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking

Apr 03, 2021

Peng Chu, Jiang Wang, Quanzeng You, Haibin Ling, Zicheng Liu

Figure 1 for TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking

Figure 2 for TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking

Figure 3 for TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking

Figure 4 for TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking

Share this with someone who'll enjoy it:

Abstract:Tracking multiple objects in videos relies on modeling the spatial-temporal interactions of the objects. In this paper, we propose a solution named TransMOT, which leverages powerful graph transformers to efficiently model the spatial and temporal interactions among the objects. TransMOT effectively models the interactions of a large number of objects by arranging the trajectories of the tracked objects as a set of sparse weighted graphs, and constructing a spatial graph transformer encoder layer, a temporal transformer encoder layer, and a spatial graph transformer decoder layer based on the graphs. TransMOT is not only more computationally efficient than the traditional Transformer, but it also achieves better tracking accuracy. To further improve the tracking speed and accuracy, we propose a cascade association framework to handle low-score detections and long-term occlusions that require large computational resources to model in TransMOT. The proposed method is evaluated on multiple benchmark datasets including MOT15, MOT16, MOT17, and MOT20, and it achieves state-of-the-art performance on all the datasets.

View paper on

Share this with someone who'll enjoy it:

Title:TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking

Paper and Code