Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Paul Misterka

Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models

Aug 24, 2023

Jan Warchocki, Teodor Oprescu, Yunhan Wang, Alexandru Damacus, Paul Misterka, Robert-Jan Bruintjes, Attila Lengyel, Ombretta Strafforello, Jan van Gemert

Figure 1 for Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models

Figure 2 for Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models

Figure 3 for Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models

Figure 4 for Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models

Abstract:In temporal action localization, given an input video, the goal is to predict which actions it contains, where they begin, and where they end. Training and testing current state-of-the-art deep learning models requires access to large amounts of data and computational power. However, gathering such data is challenging and computational resources might be limited. This work explores and measures how current deep temporal action localization models perform in settings constrained by the amount of data or computational power. We measure data efficiency by training each model on a subset of the training set. We find that TemporalMaxer outperforms other models in data-limited settings. Furthermore, we recommend TriDet when training time is limited. To test the efficiency of the models during inference, we pass videos of different lengths through each model. We find that TemporalMaxer requires the least computational resources, likely due to its simple architecture.

* Accepted to the CVEU workshop at ICCV 2023

Via

Access Paper or Ask Questions