Picture for Sander Land

Sander Land

Understanding Likelihood Over-optimisation in Direct Alignment Algorithms

Add code
Oct 15, 2024
Viaarxiv icon

Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

Add code
May 08, 2024
Viaarxiv icon