Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos

Jul 30, 2019

Sebastian Agethen, Winston H. Hsu

Figure 1 for Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos

Figure 2 for Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos

Figure 3 for Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos

Figure 4 for Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos

Share this with someone who'll enjoy it:

Abstract:Action recognition greatly benefits motion understanding in video analysis. Recurrent networks such as long short-term memory (LSTM) networks are a popular choice for motion-aware sequence learning tasks. Recently, a convolutional extension of LSTM was proposed, in which input-to-hidden and hidden-to-hidden transitions are modeled through convolution with a single kernel. This implies an unavoidable trade-off between effectiveness and efficiency. Herein, we propose a new enhancement to convolutional LSTM networks that supports accommodation of multiple convolutional kernels and layers. This resembles a Network-in-LSTM approach, which improves upon the aforementioned concern. In addition, we propose an attention-based mechanism that is specifically designed for our multi-kernel extension. We evaluated our proposed extensions in a supervised classification setting on the UCF-101 and Sports-1M datasets, with the findings showing that our enhancements improve accuracy. We also undertook qualitative analysis to reveal the characteristics of our system and the convolutional LSTM baseline.

View paper on

Share this with someone who'll enjoy it:

Title:Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos

Paper and Code