Picture for Lee Ka Wei Roy

Lee Ka Wei Roy

Interpreting Bias in Large Language Models: A Feature-Based Approach

Add code
Jun 18, 2024
Viaarxiv icon