Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Linghui Wu

Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task

Oct 08, 2022

Cong Ma, Yaping Zhang, Mei Tu, Xu Han, Linghui Wu, Yang Zhao, Yu Zhou

Figure 1 for Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task

Figure 2 for Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task

Figure 3 for Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task

Figure 4 for Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task

Abstract:End-to-end text image translation (TIT), which aims at translating the source language embedded in images to the target language, has attracted intensive attention in recent research. However, data sparsity limits the performance of end-to-end text image translation. Multi-task learning is a non-trivial way to alleviate this problem via exploring knowledge from complementary related tasks. In this paper, we propose a novel text translation enhanced text image translation, which trains the end-to-end model with text translation as an auxiliary task. By sharing model parameters and multi-task training, our model is able to take full advantage of easily-available large-scale text parallel corpus. Extensive experimental results show our proposed method outperforms existing end-to-end methods, and the joint multi-task learning with both text translation and recognition tasks achieves better results, proving translation and recognition auxiliary tasks are complementary.

* Accepted at the 26TH International Conference on Pattern Recognition (ICPR 2022)

Via

Access Paper or Ask Questions