Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kowsik Nandagopan D

Remember This Event That Year? Assessing Temporal Information and Reasoning in Large Language Models

Feb 19, 2024

Himanshu Beniwal, Kowsik Nandagopan D, Mayank Singh

Figure 1 for Remember This Event That Year? Assessing Temporal Information and Reasoning in Large Language Models

Figure 2 for Remember This Event That Year? Assessing Temporal Information and Reasoning in Large Language Models

Figure 3 for Remember This Event That Year? Assessing Temporal Information and Reasoning in Large Language Models

Figure 4 for Remember This Event That Year? Assessing Temporal Information and Reasoning in Large Language Models

Abstract:Large Language Models (LLMs) are increasingly becoming ubiquitous, yet their ability to reason about and retain temporal information remains limited. This hinders their application in real-world scenarios where understanding the sequential nature of events is crucial. This paper experiments with state-of-the-art models on a novel, large-scale temporal dataset, \textbf{TempUN}, to reveal significant limitations in temporal retention and reasoning abilities. Interestingly, closed-source models indicate knowledge gaps more frequently, potentially suggesting a trade-off between uncertainty awareness and incorrect responses. Further, exploring various fine-tuning approaches yielded no major performance improvements. The associated dataset and code are available at the following URL (https://github.com/lingoiitgn/TempUN).

Via

Access Paper or Ask Questions

Cross-lingual Editing in Multilingual Language Models

Feb 03, 2024

Himanshu Beniwal, Kowsik Nandagopan D, Mayank Singh

Figure 1 for Cross-lingual Editing in Multilingual Language Models

Figure 2 for Cross-lingual Editing in Multilingual Language Models

Figure 3 for Cross-lingual Editing in Multilingual Language Models

Figure 4 for Cross-lingual Editing in Multilingual Language Models

Abstract:The training of large language models (LLMs) necessitates substantial data and computational resources, and updating outdated LLMs entails significant efforts and resources. While numerous model editing techniques (METs) have emerged to efficiently update model outputs without retraining, their effectiveness in multilingual LLMs, where knowledge is stored in diverse languages, remains an underexplored research area. This research paper introduces the cross-lingual model editing (\textbf{XME}) paradigm, wherein a fact is edited in one language, and the subsequent update propagation is observed across other languages. To investigate the XME paradigm, we conducted experiments using BLOOM, mBERT, and XLM-RoBERTa using the two writing scripts: \textit{Latin} (English, French, and Spanish) and \textit{Indic} (Hindi, Gujarati, and Bengali). The results reveal notable performance limitations of state-of-the-art METs under the XME setting, mainly when the languages involved belong to two distinct script families. These findings highlight the need for further research and development of XME techniques to address these challenges. For more comprehensive information, the dataset used in this research and the associated code are publicly available at the following URL\url{https://github.com/lingo-iitgn/XME}.

* Accepted at EACL 2024

Via

Access Paper or Ask Questions