Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Huong Le Thanh

LaVy: Vietnamese Multimodal Large Language Model

Apr 13, 2024

Chi Tran, Huong Le Thanh

Figure 1 for LaVy: Vietnamese Multimodal Large Language Model

Figure 2 for LaVy: Vietnamese Multimodal Large Language Model

Figure 3 for LaVy: Vietnamese Multimodal Large Language Model

Abstract:Large Language Models (LLMs) and Multimodal Large language models (MLLMs) have taken the world by storm with impressive abilities in complex reasoning and linguistic comprehension. Meanwhile there are plethora of works related to Vietnamese Large Language Models, the lack of high-quality resources in multimodality limits the progress of Vietnamese MLLMs. In this paper, we pioneer in address this by introducing LaVy, a state-of-the-art Vietnamese MLLM, and we also introduce LaVy-Bench benchmark designated for evaluating MLLMs's understanding on Vietnamese visual language tasks. All code and model weights are public at https://github.com/baochi0212/LaVy

* 4 pages

Via

Access Paper or Ask Questions