Picture for Sangyu Han

Sangyu Han

Decompose the model: Mechanistic interpretability in image models with Generalized Integrated Gradients (GIG)

Add code
Sep 03, 2024
Viaarxiv icon

Respect the model: Fine-grained and Robust Explanation with Sharing Ratio Decomposition

Add code
Jan 25, 2024
Viaarxiv icon