Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhixiang Shi

Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

Mar 09, 2020

Jialin Gao, Zhixiang Shi, Jiani Li, Guanshuo Wang, Yufeng Yuan, Shiming Ge, Xi Zhou

Figure 1 for Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

Figure 2 for Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

Figure 3 for Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

Figure 4 for Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network

Abstract:Accurate temporal action proposals play an important role in detecting actions from untrimmed videos. The existing approaches have difficulties in capturing global contextual information and simultaneously localizing actions with different durations. To this end, we propose a Relation-aware pyramid Network (RapNet) to generate highly accurate temporal action proposals. In RapNet, a novel relation-aware module is introduced to exploit bi-directional long-range relations between local features for context distilling. This embedded module enhances the RapNet in terms of its multi-granularity temporal proposal generation ability, given predefined anchor boxes. We further introduce a two-stage adjustment scheme to refine the proposal boundaries and measure their confidence in containing an action with snippet-level actionness. Extensive experiments on the challenging ActivityNet and THUMOS14 benchmarks demonstrate our RapNet generates superior accurate proposals over the existing state-of-the-art methods.

* accepted by AAAI-20

Via

Access Paper or Ask Questions

Relation-Aware Pyramid Network (RapNet) for temporal action proposal

Aug 09, 2019

Jialin Gao, Zhixiang Shi, Jiani Li, Yufeng Yuan, Jiwei Li, Xi Zhou

Figure 1 for Relation-Aware Pyramid Network (RapNet) for temporal action proposal

Figure 2 for Relation-Aware Pyramid Network (RapNet) for temporal action proposal

Figure 3 for Relation-Aware Pyramid Network (RapNet) for temporal action proposal

Abstract:In this technical report, we describe our solution to temporal action proposal (task 1) in ActivityNet Challenge 2019. First, we fine-tune a ResNet-50-C3D CNN on ActivityNet v1.3 based on Kinetics pretrained model to extract snippet-level video representations and then we design a Relation-Aware Pyramid Network (RapNet) to generate temporal multiscale proposals with confidence score. After that, we employ a two-stage snippet-level boundary adjustment scheme to re-rank the order of generated proposals. Ensemble methods are also been used to improve the performance of our solution, which helps us achieve 2nd place.

* Submission to temporal action proposal task in ActivityNet Challenge 2019

Via

Access Paper or Ask Questions