Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xucheng Huang

Xmodel-LM Technical Report

Jun 05, 2024

Yichuan Wang, Yang Liu, Yu Yan, Xucheng Huang, Ling Jiang

Abstract:We introduce Xmodel-LM, a compact and efficient 1.1B language model pre-trained on over 2 trillion tokens. Trained on our self-built dataset (Xdata), which balances Chinese and English corpora based on downstream task optimization, Xmodel-LM exhibits remarkable performance despite its smaller size. It notably surpasses existing open-source language models of similar scale. Our model checkpoints and code are publicly accessible on GitHub at https://github.com/XiaoduoAILab/XmodelLM.

Via

Access Paper or Ask Questions

Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model

May 15, 2024

Wanting Xu, Yang Liu, Langping He, Xucheng Huang, Ling Jiang

Figure 1 for Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model

Figure 2 for Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model

Figure 3 for Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model

Figure 4 for Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model

Abstract:We introduce Xmodel-VLM, a cutting-edge multimodal vision language model. It is designed for efficient deployment on consumer GPU servers. Our work directly confronts a pivotal industry issue by grappling with the prohibitive service costs that hinder the broad adoption of large-scale multimodal systems. Through rigorous training, we have developed a 1B-scale language model from the ground up, employing the LLaVA paradigm for modal alignment. The result, which we call Xmodel-VLM, is a lightweight yet powerful multimodal vision language model. Extensive testing across numerous classic multimodal benchmarks has revealed that despite its smaller size and faster execution, Xmodel-VLM delivers performance comparable to that of larger models. Our model checkpoints and code are publicly available on GitHub at https://github.com/XiaoduoAILab/XmodelVLM.

Via

Access Paper or Ask Questions