Picture for Jim Berend

Jim Berend

Atlas-Alignment: Making Interpretability Transferable Across Language Models

Add code
Oct 31, 2025
Viaarxiv icon

From What to How: Attributing CLIP's Latent Components Reveals Unexpected Semantic Reliance

Add code
May 26, 2025
Figure 1 for From What to How: Attributing CLIP's Latent Components Reveals Unexpected Semantic Reliance
Figure 2 for From What to How: Attributing CLIP's Latent Components Reveals Unexpected Semantic Reliance
Figure 3 for From What to How: Attributing CLIP's Latent Components Reveals Unexpected Semantic Reliance
Figure 4 for From What to How: Attributing CLIP's Latent Components Reveals Unexpected Semantic Reliance
Viaarxiv icon

Mechanistic understanding and validation of large AI models with SemanticLens

Add code
Jan 09, 2025
Figure 1 for Mechanistic understanding and validation of large AI models with SemanticLens
Figure 2 for Mechanistic understanding and validation of large AI models with SemanticLens
Viaarxiv icon

Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers

Add code
Dec 09, 2024
Figure 1 for Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers
Figure 2 for Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers
Figure 3 for Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers
Figure 4 for Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers
Viaarxiv icon

Layer-wise Feedback Propagation

Add code
Aug 23, 2023
Figure 1 for Layer-wise Feedback Propagation
Figure 2 for Layer-wise Feedback Propagation
Figure 3 for Layer-wise Feedback Propagation
Figure 4 for Layer-wise Feedback Propagation
Viaarxiv icon