Picture for Matthew Bozoukov

Matthew Bozoukov

Are LLM Evaluators Really Narcissists? Sanity Checking Self-Preference Evaluations

Add code
Jan 30, 2026
Viaarxiv icon

Minimal and Mechanistic Conditions for Behavioral Self-Awareness in LLMs

Add code
Nov 06, 2025
Viaarxiv icon

Breaking the Mirror: Activation-Based Mitigation of Self-Preference in LLM Evaluators

Add code
Sep 03, 2025
Viaarxiv icon

Uncovering Branch specialization in InceptionV1 using k sparse autoencoders

Add code
Apr 14, 2025
Viaarxiv icon