Picture for Michael Oesterle

Michael Oesterle

Evolution of SAE Features Across Layers in LLMs

Add code
Oct 11, 2024
Viaarxiv icon

GOV-REK: Governed Reward Engineering Kernels for Designing Robust Multi-Agent Reinforcement Learning Systems

Add code
Apr 14, 2024
Viaarxiv icon

Beyond Single-Feature Importance with ICECREAM

Add code
Jul 19, 2023
Viaarxiv icon