Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Udhay Nallasamy

Do LLMs "know" internally when they follow instructions?

Oct 22, 2024

Juyeon Heo, Christina Heinze-Deml, Oussama Elachqar, Shirley Ren, Udhay Nallasamy, Andy Miller, Kwan Ho Ryan Chan, Jaya Narain

Figure 1 for Do LLMs "know" internally when they follow instructions?

Figure 2 for Do LLMs "know" internally when they follow instructions?

Figure 3 for Do LLMs "know" internally when they follow instructions?

Figure 4 for Do LLMs "know" internally when they follow instructions?

Abstract:Instruction-following is crucial for building AI agents with large language models (LLMs), as these models must adhere strictly to user-provided constraints and guidelines. However, LLMs often fail to follow even simple and clear instructions. To improve instruction-following behavior and prevent undesirable outputs, a deeper understanding of how LLMs' internal states relate to these outcomes is required. Our analysis of LLM internal states reveal a dimension in the input embedding space linked to successful instruction-following. We demonstrate that modifying representations along this dimension improves instruction-following success rates compared to random changes, without compromising response quality. Further investigation reveals that this dimension is more closely related to the phrasing of prompts rather than the inherent difficulty of the task or instructions. This discovery also suggests explanations for why LLMs sometimes fail to follow clear instructions and why prompt engineering is often effective, even when the content remains largely unchanged. This work provides insight into the internal workings of LLMs' instruction-following, paving the way for reliable LLM agents.

Via

Access Paper or Ask Questions

Variational Neural Machine Translation with Normalizing Flows

May 28, 2020

Hendra Setiawan, Matthias Sperber, Udhay Nallasamy, Matthias Paulik

Figure 1 for Variational Neural Machine Translation with Normalizing Flows

Figure 2 for Variational Neural Machine Translation with Normalizing Flows

Figure 3 for Variational Neural Machine Translation with Normalizing Flows

Figure 4 for Variational Neural Machine Translation with Normalizing Flows

Abstract:Variational Neural Machine Translation (VNMT) is an attractive framework for modeling the generation of target translations, conditioned not only on the source sentence but also on some latent random variables. The latent variable modeling may introduce useful statistical dependencies that can improve translation accuracy. Unfortunately, learning informative latent variables is non-trivial, as the latent space can be prohibitively large, and the latent codes are prone to be ignored by many translation models at training time. Previous works impose strong assumptions on the distribution of the latent code and limit the choice of the NMT architecture. In this paper, we propose to apply the VNMT framework to the state-of-the-art Transformer and introduce a more flexible approximate posterior based on normalizing flows. We demonstrate the efficacy of our proposal under both in-domain and out-of-domain conditions, significantly outperforming strong baselines.

* To appear in 2020 Association for Computational Linguistics (ACL) as a short paper

Via

Access Paper or Ask Questions