Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SemiReward: A General Reward Model for Semi-supervised Learning

Oct 04, 2023

Siyuan Li, Weiyang Jin, Zedong Wang, Fang Wu, Zicheng Liu, Cheng Tan, Stan Z. Li

Figure 1 for SemiReward: A General Reward Model for Semi-supervised Learning

Figure 2 for SemiReward: A General Reward Model for Semi-supervised Learning

Figure 3 for SemiReward: A General Reward Model for Semi-supervised Learning

Figure 4 for SemiReward: A General Reward Model for Semi-supervised Learning

Share this with someone who'll enjoy it:

Abstract:Semi-supervised learning (SSL) has witnessed great progress with various improvements in the self-training framework with pseudo labeling. The main challenge is how to distinguish high-quality pseudo labels against the confirmation bias. However, existing pseudo-label selection strategies are limited to pre-defined schemes or complex hand-crafted policies specially designed for classification, failing to achieve high-quality labels, fast convergence, and task versatility simultaneously. To these ends, we propose a Semi-supervised Reward framework (SemiReward) that predicts reward scores to evaluate and filter out high-quality pseudo labels, which is pluggable to mainstream SSL methods in wide task types and scenarios. To mitigate confirmation bias, SemiReward is trained online in two stages with a generator model and subsampling strategy. With classification and regression tasks on 13 standard SSL benchmarks of three modalities, extensive experiments verify that SemiReward achieves significant performance gains and faster convergence speeds upon Pseudo Label, FlexMatch, and Free/SoftMatch.

* Preprint of 22 pages with the source code at \url{https://github.com/Westlake-AI/SemiReward}

View paper on

Share this with someone who'll enjoy it:

Title:SemiReward: A General Reward Model for Semi-supervised Learning

Paper and Code