Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Representing Videos based on Scene Layouts for Recognizing Agent-in-Place Actions

Apr 04, 2018

Ruichi Yu, Hongcheng Wang, Ang Li, Jingxiao Zheng, Vlad I. Morariu, Larry S. Davis

Figure 1 for Representing Videos based on Scene Layouts for Recognizing Agent-in-Place Actions

Figure 2 for Representing Videos based on Scene Layouts for Recognizing Agent-in-Place Actions

Figure 3 for Representing Videos based on Scene Layouts for Recognizing Agent-in-Place Actions

Figure 4 for Representing Videos based on Scene Layouts for Recognizing Agent-in-Place Actions

Share this with someone who'll enjoy it:

Abstract:We address the recognition of agent-in-place actions, which are associated with agents who perform them and places where they occur, in the context of outdoor home surveillance. We introduce a representation of the geometry and topology of scene layouts so that a network can generalize from the layouts observed in the training set to unseen layouts in the test set. This Layout-Induced Video Representation (LIVR) abstracts away low-level appearance variance and encodes geometric and topological relationships of places in a specific scene layout. LIVR partitions the semantic features of a video clip into different places to force the network to learn place-based feature descriptions; to predict the confidence of each action, LIVR aggregates features from the place associated with an action and its adjacent places on the scene layout. We introduce the Agent-in-Place Action dataset to show that our method allows neural network models to generalize significantly better to unseen scenes.

View paper on

Share this with someone who'll enjoy it:

Title:Representing Videos based on Scene Layouts for Recognizing Agent-in-Place Actions

Paper and Code