Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Adapting Large Language Models for Document-Level Machine Translation

Jan 12, 2024

Minghao Wu, Thuy-Trang Vu, Lizhen Qu, George Foster, Gholamreza Haffari

Figure 1 for Adapting Large Language Models for Document-Level Machine Translation

Figure 2 for Adapting Large Language Models for Document-Level Machine Translation

Figure 3 for Adapting Large Language Models for Document-Level Machine Translation

Figure 4 for Adapting Large Language Models for Document-Level Machine Translation

Share this with someone who'll enjoy it:

Abstract:Large language models (LLMs) have made significant strides in various natural language processing (NLP) tasks. Recent research shows that the moderately-sized LLMs often outperform their larger counterparts after task-specific fine-tuning. In this work, we delve into the process of adapting LLMs to specialize in document-level machine translation (DocMT) for a specific language pair. Firstly, we explore how prompt strategies affect downstream translation performance. Then, we conduct extensive experiments with two fine-tuning methods, three LLM backbones, and 18 translation tasks across nine language pairs. Our findings indicate that in some cases, these specialized models even surpass GPT-4 in translation performance, while they still significantly suffer from the off-target translation issue in others, even if they are exclusively fine-tuned on bilingual parallel documents. Furthermore, we provide an in-depth analysis of these LLMs tailored for DocMT, exploring aspects such as translation errors, the scaling law of parallel documents, out-of-domain generalization, and the impact of zero-shot crosslingual transfer. The findings of this research not only shed light on the strengths and limitations of LLM-based DocMT models but also provide a foundation for future research in DocMT.

* work in progress; 21 pages, 14 tables, 7 figures

View paper on

Share this with someone who'll enjoy it:

Title:Adapting Large Language Models for Document-Level Machine Translation

Paper and Code