Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Martin Bresler

Data Incubation -- Synthesizing Missing Data for Handwriting Recognition

Oct 13, 2021

Jen-Hao Rick Chang, Martin Bresler, Youssouf Chherawala, Adrien Delaye, Thomas Deselaers, Ryan Dixon, Oncel Tuzel

Figure 1 for Data Incubation -- Synthesizing Missing Data for Handwriting Recognition

Figure 2 for Data Incubation -- Synthesizing Missing Data for Handwriting Recognition

Figure 3 for Data Incubation -- Synthesizing Missing Data for Handwriting Recognition

Abstract:In this paper, we demonstrate how a generative model can be used to build a better recognizer through the control of content and style. We are building an online handwriting recognizer from a modest amount of training samples. By training our controllable handwriting synthesizer on the same data, we can synthesize handwriting with previously underrepresented content (e.g., URLs and email addresses) and style (e.g., cursive and slanted). Moreover, we propose a framework to analyze a recognizer that is trained with a mixture of real and synthetic training data. We use the framework to optimize data synthesis and demonstrate significant improvement on handwriting recognition over a model trained on real data only. Overall, we achieve a 66% reduction in Character Error Rate.

Via

Access Paper or Ask Questions