Picture for Jett Janiak

Jett Janiak

Chain-of-Thought Reasoning In The Wild Is Not Always Faithful

Add code
Mar 13, 2025
Viaarxiv icon

Characterizing stable regions in the residual stream of LLMs

Add code
Sep 26, 2024
Viaarxiv icon

An Adversarial Example for Direct Logit Attribution: Memory Management in gelu-4l

Add code
Oct 14, 2023
Viaarxiv icon