Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing

Apr 01, 2024

Yijun Yang, Jie He, Pinzhen Chen, Víctor Gutiérrez-Basulto, Jeff Z. Pan

Figure 1 for UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing

Figure 2 for UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing

Figure 3 for UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing

Figure 4 for UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing

Share this with someone who'll enjoy it:

Abstract:Several recent papers have investigated the potential of language models as knowledge bases as well as the existence of severe biases when extracting factual knowledge. In this work, we focus on the factual probing performance over unseen prompts from tuning, and using a probabilistic view we show the inherent misalignment between pre-training and downstream tuning objectives in language models for probing knowledge. We hypothesize that simultaneously debiasing these objectives can be the key to generalisation over unseen prompts. We propose an adapter-based framework, UniArk, for generalised and consistent factual knowledge extraction through simple methods without introducing extra parameters. Extensive experiments show that UniArk can significantly improve the model's out-of-domain generalisation as well as consistency under various prompts. Additionally, we construct ParaTrex, a large-scale and diverse dataset for measuring the inconsistency and out-of-domain generation of models. Further, ParaTrex offers a reference method for constructing paraphrased datasets using large language models.

* NAACL 2024

View paper on

Share this with someone who'll enjoy it:

Title:UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing

Paper and Code