Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Describing Common Human Visual Actions in Images

Jun 07, 2015

Matteo Ruggero Ronchi, Pietro Perona

Figure 1 for Describing Common Human Visual Actions in Images

Figure 2 for Describing Common Human Visual Actions in Images

Figure 3 for Describing Common Human Visual Actions in Images

Figure 4 for Describing Common Human Visual Actions in Images

Share this with someone who'll enjoy it:

Abstract:Which common human actions and interactions are recognizable in monocular still images? Which involve objects and/or other people? How many is a person performing at a time? We address these questions by exploring the actions and interactions that are detectable in the images of the MS COCO dataset. We make two main contributions. First, a list of 140 common `visual actions', obtained by analyzing the largest on-line verb lexicon currently available for English (VerbNet) and human sentences used to describe images in MS COCO. Second, a complete set of annotations for those `visual actions', composed of subject-object and associated verb, which we call COCO-a (a for `actions'). COCO-a is larger than existing action datasets in terms of number of actions and instances of these actions, and is unique because it is data-driven, rather than experimenter-biased. Other unique features are that it is exhaustive, and that all subjects and objects are localized. A statistical analysis of the accuracy of our annotations and of each action, interaction and subject-object combination is provided.

View paper on

Share this with someone who'll enjoy it:

Title:Describing Common Human Visual Actions in Images

Paper and Code