Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis

Dec 08, 2024

Aman Kassahun Wassie, Mahdi Molaei, Yasmin Moslem

Figure 1 for Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis

Figure 2 for Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis

Figure 3 for Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis

Figure 4 for Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis

Share this with someone who'll enjoy it:

Abstract:In this work, we compare the domain-specific translation performance of open-source autoregressive decoder-only large language models (LLMs) with task-oriented machine translation (MT) models. Our experiments focus on the medical domain and cover four language pairs with varied resource availability: English-to-French, English-to-Portuguese, English-to-Swahili, and Swahili-to-English. Despite recent advancements, LLMs exhibit a clear gap in specialized translation quality compared to multilingual encoder-decoder MT models such as NLLB-200. In three out of four language directions in our study, NLLB-200 3.3B outperforms all LLMs in the size range of 8B parameters in medical translation. While fine-tuning LLMs such as Mistral and Llama improves their performance at medical translation, these models still fall short compared to fine-tuned NLLB-200 3.3B models. Our findings highlight the ongoing need for specialized MT models to achieve higher-quality domain-specific translation, especially in medium-resource and low-resource settings. As larger LLMs outperform their 8B variants, this also encourages pre-training domain-specific medium-sized LMs to improve quality and efficiency in specialized translation tasks.

View paper on

Share this with someone who'll enjoy it:

Title:Domain-Specific Translation with Open-Source Large Language Models: Resource-Oriented Analysis

Paper and Code