Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Real-Fake: Effective Training Data Synthesis Through Distribution Matching

Oct 16, 2023

Jianhao Yuan, Jie Zhang, Shuyang Sun, Philip Torr, Bo Zhao

Figure 1 for Real-Fake: Effective Training Data Synthesis Through Distribution Matching

Figure 2 for Real-Fake: Effective Training Data Synthesis Through Distribution Matching

Figure 3 for Real-Fake: Effective Training Data Synthesis Through Distribution Matching

Figure 4 for Real-Fake: Effective Training Data Synthesis Through Distribution Matching

Share this with someone who'll enjoy it:

Abstract:Synthetic training data has gained prominence in numerous learning tasks and scenarios, offering advantages such as dataset augmentation, generalization evaluation, and privacy preservation. Despite these benefits, the efficiency of synthetic data generated by current methodologies remains inferior when training advanced deep models exclusively, limiting its practical utility. To address this challenge, we analyze the principles underlying training data synthesis for supervised learning and elucidate a principled theoretical framework from the distribution-matching perspective that explicates the mechanisms governing synthesis efficacy. Through extensive experiments, we demonstrate the effectiveness of our synthetic data across diverse image classification tasks, both as a replacement for and augmentation to real datasets, while also benefits challenging tasks such as out-of-distribution generalization and privacy preservation.

* Code released at (https://github.com/BAAI-DCAI/Training-Data-Synthesis)

View paper on

Share this with someone who'll enjoy it:

Title:Real-Fake: Effective Training Data Synthesis Through Distribution Matching

Paper and Code