Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Suhyune Son

Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations

Jun 16, 2024

Yoonna Jang, Suhyune Son, Jeongwoo Lee, Junyoung Son, Yuna Hur, Jungwoo Lim, Hyeonseok Moon, Kisu Yang, Heuiseok Lim

Figure 1 for Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations

Figure 2 for Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations

Figure 3 for Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations

Figure 4 for Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations

Abstract:Despite the striking advances in recent language generation performance, model-generated responses have suffered from the chronic problem of hallucinations that are either untrue or unfaithful to a given source. Especially in the task of knowledge grounded conversation, the models are required to generate informative responses, but hallucinated utterances lead to miscommunication. In particular, entity-level hallucination that causes critical misinformation and undesirable conversation is one of the major concerns. To address this issue, we propose a post-hoc refinement method called REM. It aims to enhance the quality and faithfulness of hallucinated utterances by refining them based on the source knowledge. If the generated utterance has a low source-faithfulness score with the given knowledge, REM mines the key entities in the knowledge and implicitly uses them for refining the utterances. We verify that our method reduces entity hallucination in the utterance. Also, we show the adaptability and efficacy of REM with extensive experiments and generative results. Our code is available at https://github.com/YOONNAJANG/REM.

* Accepted at EMNLP 2023

Via

Access Paper or Ask Questions

Language Chameleon: Transformation analysis between languages using Cross-lingual Post-training based on Pre-trained language models

Sep 14, 2022

Suhyune Son, Chanjun Park, Jungseob Lee, Midan Shim, Chanhee Lee, Yoonna Jang, Jaehyung Seo, Heuiseok Lim

Figure 1 for Language Chameleon: Transformation analysis between languages using Cross-lingual Post-training based on Pre-trained language models

Figure 2 for Language Chameleon: Transformation analysis between languages using Cross-lingual Post-training based on Pre-trained language models

Figure 3 for Language Chameleon: Transformation analysis between languages using Cross-lingual Post-training based on Pre-trained language models

Figure 4 for Language Chameleon: Transformation analysis between languages using Cross-lingual Post-training based on Pre-trained language models

Abstract:As pre-trained language models become more resource-demanding, the inequality between resource-rich languages such as English and resource-scarce languages is worsening. This can be attributed to the fact that the amount of available training data in each language follows the power-law distribution, and most of the languages belong to the long tail of the distribution. Some research areas attempt to mitigate this problem. For example, in cross-lingual transfer learning and multilingual training, the goal is to benefit long-tail languages via the knowledge acquired from resource-rich languages. Although being successful, existing work has mainly focused on experimenting on as many languages as possible. As a result, targeted in-depth analysis is mostly absent. In this study, we focus on a single low-resource language and perform extensive evaluation and probing experiments using cross-lingual post-training (XPT). To make the transfer scenario challenging, we choose Korean as the target language, as it is a language isolate and thus shares almost no typology with English. Results show that XPT not only outperforms or performs on par with monolingual models trained with orders of magnitudes more data but also is highly efficient in the transfer process.

Via

Access Paper or Ask Questions

Empirical study on BlenderBot 2.0 Errors Analysis in terms of Model, Data and User-Centric Approach

Jan 10, 2022

Jungseob Lee, Midan Shim, Suhyune Son, Yujin Kim, Chanjun Park, Heuiseok Lim

Figure 1 for Empirical study on BlenderBot 2.0 Errors Analysis in terms of Model, Data and User-Centric Approach

Figure 2 for Empirical study on BlenderBot 2.0 Errors Analysis in terms of Model, Data and User-Centric Approach

Figure 3 for Empirical study on BlenderBot 2.0 Errors Analysis in terms of Model, Data and User-Centric Approach

Figure 4 for Empirical study on BlenderBot 2.0 Errors Analysis in terms of Model, Data and User-Centric Approach

Abstract:BlenderBot 2.0 is a dialogue model that represents open-domain chatbots by reflecting real-time information and remembering user information for an extended period using an internet search module and multi-session. Nonetheless, the model still has room for improvement. To this end, we examined BlenderBot 2.0 limitations and errors from three perspectives: model, data, and user. From the data point of view, we highlight the unclear guidelines provided to workers during the crowdsourcing process, as well as a lack of a process for refining hate speech in the collected data and verifying the accuracy of internet-based information. From a user perspective, we identify nine types of problems of BlenderBot 2.0, and their causes are thoroughly investigated. Furthermore, for each point of view, practical improvement methods are proposed, and we discuss several potential future research directions.

* English Version of "Empirical study on BlenderBot 2.0 errors analysis in terms of model, data and dialogue" (Journal of the Korea Convergence Society)

Via

Access Paper or Ask Questions

Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge

Dec 16, 2021

Yoonna Jang, Jungwoo Lim, Yuna Hur, Dongsuk Oh, Suhyune Son, Yeonsoo Lee, Donghoon Shin, Seungryong Kim, Heuiseok Lim

Figure 1 for Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge

Figure 2 for Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge

Figure 3 for Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge

Figure 4 for Call for Customized Conversation: Customized Conversation Grounding Persona and Knowledge

Abstract:Humans usually have conversations by making use of prior knowledge about a topic and background information of the people whom they are talking to. However, existing conversational agents and datasets do not consider such comprehensive information, and thus they have a limitation in generating the utterances where the knowledge and persona are fused properly. To address this issue, we introduce a call For Customized conversation (FoCus) dataset where the customized answers are built with the user's persona and Wikipedia knowledge. To evaluate the abilities to make informative and customized utterances of pre-trained language models, we utilize BART and GPT-2 as well as transformer-based models. We assess their generation abilities with automatic scores and conduct human evaluations for qualitative results. We examine whether the model reflects adequate persona and knowledge with our proposed two sub-tasks, persona grounding (PG) and knowledge grounding (KG). Moreover, we show that the utterances of our data are constructed with the proper knowledge and persona through grounding quality assessment.

* Accepted paper at the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-22)

Via

Access Paper or Ask Questions