Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jegwang Ryu

Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher

Jun 26, 2024

Hyunjong Ok, Jegwang Ryu, Jaeho Lee

Figure 1 for Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher

Figure 2 for Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher

Figure 3 for Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher

Figure 4 for Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher

Abstract:How can sLLMs efficiently utilize the supervision of LLMs to improve their generative quality? This question has been well studied in scenarios where there is no restriction on the number of LLM supervisions one can use, giving birth to many decoding algorithms that utilize supervision without further training. However, it is still unclear what is an effective strategy under the limited supervision scenario, where we assume that no more than a few tokens can be generated by LLMs. To this end, we develop an algorithm to effectively aggregate the sLLM and LLM predictions on initial tokens so that the generated tokens can more accurately condition the subsequent token generation by sLLM only. Critically, we find that it is essential to adaptively overtrust or disregard the LLM prediction based on the confidence of the sLLM. Through our experiments on a wide range of models and datasets, we demonstrate that our method provides a consistent improvement over conventional decoding strategies.

* preprint

Via

Access Paper or Ask Questions