Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Selective Pseudo-Labeling with Reinforcement Learning for Semi-Supervised Domain Adaptation

Dec 07, 2020

Bingyu Liu, Yuhong Guo, Jieping Ye, Weihong Deng

Figure 1 for Selective Pseudo-Labeling with Reinforcement Learning for Semi-Supervised Domain Adaptation

Figure 2 for Selective Pseudo-Labeling with Reinforcement Learning for Semi-Supervised Domain Adaptation

Figure 3 for Selective Pseudo-Labeling with Reinforcement Learning for Semi-Supervised Domain Adaptation

Figure 4 for Selective Pseudo-Labeling with Reinforcement Learning for Semi-Supervised Domain Adaptation

Share this with someone who'll enjoy it:

Abstract:Recent domain adaptation methods have demonstrated impressive improvement on unsupervised domain adaptation problems. However, in the semi-supervised domain adaptation (SSDA) setting where the target domain has a few labeled instances available, these methods can fail to improve performance. Inspired by the effectiveness of pseudo-labels in domain adaptation, we propose a reinforcement learning based selective pseudo-labeling method for semi-supervised domain adaptation. It is difficult for conventional pseudo-labeling methods to balance the correctness and representativeness of pseudo-labeled data. To address this limitation, we develop a deep Q-learning model to select both accurate and representative pseudo-labeled instances. Moreover, motivated by large margin loss's capacity on learning discriminative features with little data, we further propose a novel target margin loss for our base model training to improve its discriminability. Our proposed method is evaluated on several benchmark datasets for SSDA, and demonstrates superior performance to all the comparison methods.

View paper on

Share this with someone who'll enjoy it:

Title:Selective Pseudo-Labeling with Reinforcement Learning for Semi-Supervised Domain Adaptation

Paper and Code