Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ayana Niwa

Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning

Feb 28, 2025

Ayana Niwa, Masahiro Kaneko, Kentaro Inui

Abstract:Large language models (LLMs) can exhibit advanced reasoning yet still generate incorrect answers. We hypothesize that such errors frequently stem from spurious beliefs, propositions the model internally considers true but are incorrect. To address this, we propose a method to rectify the belief space by suppressing these spurious beliefs while simultaneously enhancing true ones, thereby enabling more reliable inferences. Our approach first identifies the beliefs that lead to incorrect or correct answers by prompting the model to generate textual explanations, using our Forward-Backward Beam Search (FBBS). We then apply unlearning to suppress the identified spurious beliefs and enhance the true ones, effectively rectifying the model's belief space. Empirical results on multiple QA datasets and LLMs show that our method corrects previously misanswered questions without harming overall model performance. Furthermore, our approach yields improved generalization on unseen data, suggesting that rectifying a model's belief space is a promising direction for mitigating errors and enhancing overall reliability.

Via

Access Paper or Ask Questions

AmbigNLG: Addressing Task Ambiguity in Instruction for NLG

Feb 27, 2024

Ayana Niwa, Hayate Iso

Figure 1 for AmbigNLG: Addressing Task Ambiguity in Instruction for NLG

Figure 2 for AmbigNLG: Addressing Task Ambiguity in Instruction for NLG

Figure 3 for AmbigNLG: Addressing Task Ambiguity in Instruction for NLG

Figure 4 for AmbigNLG: Addressing Task Ambiguity in Instruction for NLG

Abstract:In this study, we introduce AmbigNLG, a new task designed to tackle the challenge of task ambiguity in instructions for Natural Language Generation (NLG) tasks. Despite the impressive capabilities of Large Language Models (LLMs) in understanding and executing a wide range of tasks through natural language interaction, their performance is significantly hindered by the ambiguity present in real-world instructions. To address this, AmbigNLG seeks to identify and mitigate such ambiguities, aiming to refine instructions to match user expectations better. We introduce a dataset, AmbigSNI-NLG, consisting of 2,500 instances, and develop an ambiguity taxonomy for categorizing and annotating instruction ambiguities. Our approach demonstrates substantial improvements in text generation quality, highlighting the critical role of clear and specific instructions in enhancing LLM performance in NLG tasks.

* work in progress

Via

Access Paper or Ask Questions

Nearest Neighbor Non-autoregressive Text Generation

Aug 26, 2022

Ayana Niwa, Sho Takase, Naoaki Okazaki

Figure 1 for Nearest Neighbor Non-autoregressive Text Generation

Figure 2 for Nearest Neighbor Non-autoregressive Text Generation

Figure 3 for Nearest Neighbor Non-autoregressive Text Generation

Figure 4 for Nearest Neighbor Non-autoregressive Text Generation

Abstract:Non-autoregressive (NAR) models can generate sentences with less computation than autoregressive models but sacrifice generation quality. Previous studies addressed this issue through iterative decoding. This study proposes using nearest neighbors as the initial state of an NAR decoder and editing them iteratively. We present a novel training strategy to learn the edit operations on neighbors to improve NAR text generation. Experimental results show that the proposed method (NeighborEdit) achieves higher translation quality (1.69 points higher than the vanilla Transformer) with fewer decoding iterations (one-eighteenth fewer iterations) on the JRC-Acquis En-De dataset, the common benchmark dataset for machine translation using nearest neighbors. We also confirm the effectiveness of the proposed method on a data-to-text task (WikiBio). In addition, the proposed method outperforms an NAR baseline on the WMT'14 En-De dataset. We also report analysis on neighbor examples used in the proposed method.

Via

Access Paper or Ask Questions

Interpretability for Language Learners Using Example-Based Grammatical Error Correction

Mar 14, 2022

Masahiro Kaneko, Sho Takase, Ayana Niwa, Naoaki Okazaki

Figure 1 for Interpretability for Language Learners Using Example-Based Grammatical Error Correction

Figure 2 for Interpretability for Language Learners Using Example-Based Grammatical Error Correction

Figure 3 for Interpretability for Language Learners Using Example-Based Grammatical Error Correction

Figure 4 for Interpretability for Language Learners Using Example-Based Grammatical Error Correction

Abstract:Grammatical Error Correction (GEC) should not focus only on high accuracy of corrections but also on interpretability for language learning. However, existing neural-based GEC models mainly aim at improving accuracy, and their interpretability has not been explored. A promising approach for improving interpretability is an example-based method, which uses similar retrieved examples to generate corrections. In addition, examples are beneficial in language learning, helping learners understand the basis of grammatically incorrect/correct texts and improve their confidence in writing. Therefore, we hypothesize that incorporating an example-based method into GEC can improve interpretability as well as support language learners. In this study, we introduce an Example-Based GEC (EB-GEC) that presents examples to language learners as a basis for a correction result. The examples consist of pairs of correct and incorrect sentences similar to a given input and its predicted correction. Experiments demonstrate that the examples presented by EB-GEC help language learners decide to accept or refuse suggestions from the GEC output. Furthermore, the experiments also show that retrieved examples improve the accuracy of corrections.

* ACL 2022

Via

Access Paper or Ask Questions