Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kota Tanabe

Enhancing Financial Domain Adaptation of Language Models via Model Augmentation

Nov 14, 2024

Kota Tanabe, Masanori Hirano, Kazuki Matoya, Kentaro Imajo, Hiroki Sakaji, Itsuki Noda

Figure 1 for Enhancing Financial Domain Adaptation of Language Models via Model Augmentation

Figure 2 for Enhancing Financial Domain Adaptation of Language Models via Model Augmentation

Figure 3 for Enhancing Financial Domain Adaptation of Language Models via Model Augmentation

Abstract:The domain adaptation of language models, including large language models (LLMs), has become increasingly important as the use of such models continues to expand. This study demonstrates the effectiveness of Composition to Augment Language Models (CALM) in adapting to the financial domain. CALM is a model to extend the capabilities of existing models by introducing cross-attention between two LLMs with different functions. In our experiments, we developed a CALM to enhance the financial performance of an LLM with strong response capabilities by leveraging a financial-specialized LLM. Notably, the CALM was trained using a financial dataset different from the one used to train the financial-specialized LLM, confirming CALM's ability to adapt to various datasets. The models were evaluated through quantitative Japanese financial benchmarks and qualitative response comparisons, demonstrating that CALM enables superior responses with higher scores than the original models and baselines. Additionally, comparative experiments on connection points revealed that connecting the middle layers of the models is most effective in facilitating adaptation to the financial domain. These findings confirm that CALM is a practical approach for adapting LLMs to the financial domain.

Via

Access Paper or Ask Questions

JaFIn: Japanese Financial Instruction Dataset

Apr 14, 2024

Kota Tanabe, Masahiro Suzuki, Hiroki Sakaji, Itsuki Noda

Figure 1 for JaFIn: Japanese Financial Instruction Dataset

Figure 2 for JaFIn: Japanese Financial Instruction Dataset

Figure 3 for JaFIn: Japanese Financial Instruction Dataset

Figure 4 for JaFIn: Japanese Financial Instruction Dataset

Abstract:We construct an instruction dataset for the large language model (LLM) in the Japanese finance domain. Domain adaptation of language models, including LLMs, is receiving more attention as language models become more popular. This study demonstrates the effectiveness of domain adaptation through instruction tuning. To achieve this, we propose an instruction tuning data in Japanese called JaFIn, the Japanese Financial Instruction Dataset. JaFIn is manually constructed based on multiple data sources, including Japanese government websites, which provide extensive financial knowledge. We then utilize JaFIn to apply instruction tuning for several LLMs, demonstrating that our models specialized in finance have better domain adaptability than the original models. The financial-specialized LLMs created were evaluated using a quantitative Japanese financial benchmark and qualitative response comparisons, showing improved performance over the originals.

* 10 pages, 1 figure

Via

Access Paper or Ask Questions