Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task

Jun 14, 2022

Ziqiang Zhang, Junyi Ao, Long Zhou, Shujie Liu, Furu Wei, Jinyu Li

Figure 1 for The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task

Figure 2 for The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task

Figure 3 for The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task

Figure 4 for The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task

Share this with someone who'll enjoy it:

Abstract:This paper describes the submission of our end-to-end YiTrans speech translation system for the IWSLT 2022 offline task, which translates from English audio to German, Chinese, and Japanese. The YiTrans system is built on large-scale pre-trained encoder-decoder models. More specifically, we first design a multi-stage pre-training strategy to build a multi-modality model with a large amount of labeled and unlabeled data. We then fine-tune the corresponding components of the model for the downstream speech translation tasks. Moreover, we make various efforts to improve performance, such as data filtering, data augmentation, speech segmentation, model ensemble, and so on. Experimental results show that our YiTrans system obtains a significant improvement than the strong baseline on three translation directions, and it achieves +5.2 BLEU improvements over last year's optimal end-to-end system on tst2021 English-German. Our final submissions rank first on English-German and English-Chinese end-to-end systems in terms of the automatic evaluation metric. We make our code and models publicly available.

* 11 pages

View paper on

Share this with someone who'll enjoy it:

Title:The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task

Paper and Code