Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tomohito Kasahara

Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation

Oct 17, 2023

Tomohito Kasahara, Daisuke Kawahara

Figure 1 for Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation

Figure 2 for Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation

Figure 3 for Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation

Figure 4 for Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation

Abstract:Automatic evaluation of text generation is essential for improving the accuracy of generation tasks. In light of the current trend towards increasingly larger decoder-based language models, we investigate automatic evaluation methods based on such models for text generation. This paper compares various methods, including tuning with encoder-based models and large language models under equal conditions, on two different tasks, machine translation evaluation and semantic textual similarity, in two languages, Japanese and English. Experimental results show that compared to the tuned encoder-based models, the tuned decoder-based models perform poorly. The analysis of the causes for this suggests that the decoder-based models focus on surface word sequences and do not capture meaning. It is also revealed that in-context learning of very large decoder-based models such as ChatGPT makes it difficult to identify fine-grained semantic differences.

* Accepted to IJCNLP-AACL 2023 SRW

Via

Access Paper or Ask Questions

Building a Personalized Dialogue System with Prompt-Tuning

Jun 11, 2022

Tomohito Kasahara, Daisuke Kawahara, Nguyen Tung, Shengzhe Li, Kenta Shinzato, Toshinori Sato

Figure 1 for Building a Personalized Dialogue System with Prompt-Tuning

Figure 2 for Building a Personalized Dialogue System with Prompt-Tuning

Figure 3 for Building a Personalized Dialogue System with Prompt-Tuning

Figure 4 for Building a Personalized Dialogue System with Prompt-Tuning

Abstract:Dialogue systems without consistent responses are not fascinating. In this study, we build a dialogue system that can respond based on a given character setting (persona) to bring consistency. Considering the trend of the rapidly increasing scale of language models, we propose an approach that uses prompt-tuning, which has low learning costs, on pre-trained large-scale language models. The results of automatic and manual evaluations in English and Japanese show that it is possible to build a dialogue system with more natural and personalized responses using less computational resources than fine-tuning.

* Accepted to NAACL 2022 SRW

Via

Access Paper or Ask Questions