Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dragos Corlatescu

"Vorbeşti Româneşte?" A Recipe to Train Powerful Romanian LLMs with English Instructions

Jun 26, 2024

Mihai Masala, Denis C. Ilie-Ablachim, Alexandru Dima, Dragos Corlatescu, Miruna Zavelca, Ovio Olaru, Simina Terian-Dan, Andrei Terian-Dan, Marius Leordeanu, Horia Velicu(+3 more)

Figure 1 for "Vorbeşti Româneşte?" A Recipe to Train Powerful Romanian LLMs with English Instructions

Figure 2 for "Vorbeşti Româneşte?" A Recipe to Train Powerful Romanian LLMs with English Instructions

Figure 3 for "Vorbeşti Româneşte?" A Recipe to Train Powerful Romanian LLMs with English Instructions

Figure 4 for "Vorbeşti Româneşte?" A Recipe to Train Powerful Romanian LLMs with English Instructions

Abstract:In recent years, Large Language Models (LLMs) have achieved almost human-like performance on various tasks. While some LLMs have been trained on multilingual data, most of the training data is in English; hence, their performance in English greatly exceeds other languages. To our knowledge, we are the first to collect and translate a large collection of texts, instructions, and benchmarks and train, evaluate, and release open-source LLMs tailored for Romanian. We evaluate our methods on four different categories, including academic benchmarks, MT-Bench (manually translated), and a professionally built historical, cultural, and social benchmark adapted to Romanian. We argue for the usefulness and high performance of RoLLMs by obtaining state-of-the-art results across the board. We publicly release all resources (i.e., data, training and evaluation code, models) to support and encourage research on Romanian LLMs while concurrently creating a generalizable recipe, adequate for other low or less-resourced languages.

* arXiv admin note: text overlap with arXiv:2405.07703

Via

Access Paper or Ask Questions

OpenLLM-Ro -- Technical Report on Open-source Romanian LLMs

May 17, 2024

Mihai Masala, Denis C. Ilie-Ablachim, Dragos Corlatescu, Miruna Zavelca, Marius Leordeanu, Horia Velicu, Marius Popescu, Mihai Dascalu, Traian Rebedea

Figure 1 for OpenLLM-Ro -- Technical Report on Open-source Romanian LLMs

Figure 2 for OpenLLM-Ro -- Technical Report on Open-source Romanian LLMs

Figure 3 for OpenLLM-Ro -- Technical Report on Open-source Romanian LLMs

Figure 4 for OpenLLM-Ro -- Technical Report on Open-source Romanian LLMs

Via

Access Paper or Ask Questions