Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Faidon Mitzalis

BERTGEN: Multi-task Generation through BERT

Jun 07, 2021

Faidon Mitzalis, Ozan Caglayan, Pranava Madhyastha, Lucia Specia

Figure 1 for BERTGEN: Multi-task Generation through BERT

Figure 2 for BERTGEN: Multi-task Generation through BERT

Figure 3 for BERTGEN: Multi-task Generation through BERT

Figure 4 for BERTGEN: Multi-task Generation through BERT

Abstract:We present BERTGEN, a novel generative, decoder-only model which extends BERT by fusing multimodal and multilingual pretrained models VL-BERT and M-BERT, respectively. BERTGEN is auto-regressively trained for language generation tasks, namely image captioning, machine translation and multimodal machine translation, under a multitask setting. With a comprehensive set of evaluations, we show that BERTGEN outperforms many strong baselines across the tasks explored. We also show BERTGEN's ability for zero-shot language generation, where it exhibits competitive performance to supervised counterparts. Finally, we conduct ablation studies which demonstrate that BERTGEN substantially benefits from multi-tasking and effectively transfers relevant inductive biases from the pre-trained models.

* Accepted to ACL 2021 Main Conference

Via

Access Paper or Ask Questions