Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses

Dec 15, 2020

Chen Ju, Peisen Zhao, Ya Zhang, Yanfeng Wang, Qi Tian

Figure 1 for Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses

Figure 2 for Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses

Figure 3 for Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses

Figure 4 for Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses

Share this with someone who'll enjoy it:

Abstract:Point-Level temporal action localization (PTAL) aims to localize actions in untrimmed videos with only one timestamp annotation for each action instance. Existing methods adopt the frame-level prediction paradigm to learn from the sparse single-frame labels. However, such a framework inevitably suffers from a large solution space. This paper attempts to explore the proposal-based prediction paradigm for point-level annotations, which has the advantage of more constrained solution space and consistent predictions among neighboring frames. The point-level annotations are first used as the keypoint supervision to train a keypoint detector. At the location prediction stage, a simple but effective mapper module, which enables back-propagation of training errors, is then introduced to bridge the fully-supervised framework with weak supervision. To our best of knowledge, this is the first work to leverage the fully-supervised paradigm for the point-level setting. Experiments on THUMOS14, BEOID, and GTEA verify the effectiveness of our proposed method both quantitatively and qualitatively, and demonstrate that our method outperforms state-of-the-art methods.

View paper on

Share this with someone who'll enjoy it:

Title:Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses

Paper and Code