Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Long Short-Term Transformer for Online Action Detection

Jul 07, 2021

Mingze Xu, Yuanjun Xiong, Hao Chen, Xinyu Li, Wei Xia, Zhuowen Tu, Stefano Soatto

Figure 1 for Long Short-Term Transformer for Online Action Detection

Figure 2 for Long Short-Term Transformer for Online Action Detection

Figure 3 for Long Short-Term Transformer for Online Action Detection

Figure 4 for Long Short-Term Transformer for Online Action Detection

Share this with someone who'll enjoy it:

Abstract:In this paper, we present Long Short-term TRansformer (LSTR), a new temporal modeling algorithm for online action detection, by employing a long- and short-term memories mechanism that is able to model prolonged sequence data. It consists of an LSTR encoder that is capable of dynamically exploiting coarse-scale historical information from an extensively long time window (e.g., 2048 long-range frames of up to 8 minutes), together with an LSTR decoder that focuses on a short time window (e.g., 32 short-range frames of 8 seconds) to model the fine-scale characterization of the ongoing event. Compared to prior work, LSTR provides an effective and efficient method to model long videos with less heuristic algorithm design. LSTR achieves significantly improved results on standard online action detection benchmarks, THUMOS'14, TVSeries, and HACS Segment, over the existing state-of-the-art approaches. Extensive empirical analysis validates the setup of the long- and short-term memories and the design choices of LSTR.

* Technical report

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Long Short-Term Transformer for Online Action Detection

Paper and Code