Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zheyan Luo

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Mar 21, 2024

Yaowei Zheng, Richong Zhang, Junhao Zhang, Yanhan Ye, Zheyan Luo, Yongqiang Ma

Figure 1 for LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Figure 2 for LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Figure 3 for LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Figure 4 for LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Abstract:Efficient fine-tuning is vital for adapting large language models (LLMs) to downstream tasks. However, it requires non-trivial efforts to implement these methods on different models. We present LlamaFactory, a unified framework that integrates a suite of cutting-edge efficient training methods. It allows users to flexibly customize the fine-tuning of 100+ LLMs without the need for coding through the built-in web UI LlamaBoard. We empirically validate the efficiency and effectiveness of our framework on language modeling and text generation tasks. It has been released at https://github.com/hiyouga/LLaMA-Factory and already received over 13,000 stars and 1,600 forks.

* 12 pages, preprint

Via

Access Paper or Ask Questions

AdversarialWord Dilution as Text Data Augmentation in Low-Resource Regime

May 16, 2023

Junfan Chen, Richong Zhang, Zheyan Luo, Chunming Hu, Yongyi Mao

Figure 1 for AdversarialWord Dilution as Text Data Augmentation in Low-Resource Regime

Figure 2 for AdversarialWord Dilution as Text Data Augmentation in Low-Resource Regime

Figure 3 for AdversarialWord Dilution as Text Data Augmentation in Low-Resource Regime

Figure 4 for AdversarialWord Dilution as Text Data Augmentation in Low-Resource Regime

Abstract:Data augmentation is widely used in text classification, especially in the low-resource regime where a few examples for each class are available during training. Despite the success, generating data augmentations as hard positive examples that may increase their effectiveness is under-explored. This paper proposes an Adversarial Word Dilution (AWD) method that can generate hard positive examples as text data augmentations to train the low-resource text classification model efficiently. Our idea of augmenting the text data is to dilute the embedding of strong positive words by weighted mixing with unknown-word embedding, making the augmented inputs hard to be recognized as positive by the classification model. We adversarially learn the dilution weights through a constrained min-max optimization process with the guidance of the labels. Empirical studies on three benchmark datasets show that AWD can generate more effective data augmentations and outperform the state-of-the-art text data augmentation methods. The additional analysis demonstrates that the data augmentations generated by AWD are interpretable and can flexibly extend to new examples without further training.

* Preprint, Accepted by AAAI 2023

Via

Access Paper or Ask Questions