Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Improving Human-Object Interaction Detection via Virtual Image Learning

Aug 04, 2023

Shuman Fang, Shuai Liu, Jie Li, Guannan Jiang, Xianming Lin, Rongrong Ji

Figure 1 for Improving Human-Object Interaction Detection via Virtual Image Learning

Figure 2 for Improving Human-Object Interaction Detection via Virtual Image Learning

Figure 3 for Improving Human-Object Interaction Detection via Virtual Image Learning

Figure 4 for Improving Human-Object Interaction Detection via Virtual Image Learning

Share this with someone who'll enjoy it:

Abstract:Human-Object Interaction (HOI) detection aims to understand the interactions between humans and objects, which plays a curtail role in high-level semantic understanding tasks. However, most works pursue designing better architectures to learn overall features more efficiently, while ignoring the long-tail nature of interaction-object pair categories. In this paper, we propose to alleviate the impact of such an unbalanced distribution via Virtual Image Leaning (VIL). Firstly, a novel label-to-image approach, Multiple Steps Image Creation (MUSIC), is proposed to create a high-quality dataset that has a consistent distribution with real images. In this stage, virtual images are generated based on prompts with specific characterizations and selected by multi-filtering processes. Secondly, we use both virtual and real images to train the model with the teacher-student framework. Considering the initial labels of some virtual images are inaccurate and inadequate, we devise an Adaptive Matching-and-Filtering (AMF) module to construct pseudo-labels. Our method is independent of the internal structure of HOI detectors, so it can be combined with off-the-shelf methods by training merely 10 additional epochs. With the assistance of our method, multiple methods obtain significant improvements, and new state-of-the-art results are achieved on two benchmarks.

* Accepted by ACM MM 2023

View paper on

Share this with someone who'll enjoy it:

Title:Improving Human-Object Interaction Detection via Virtual Image Learning

Paper and Code