Picture for Martina G. Vilas

Martina G. Vilas

The Computational Complexity of Circuit Discovery for Inner Interpretability

Add code
Oct 10, 2024
Viaarxiv icon

Limited but consistent gains in adversarial robustness by co-training object recognition models with human EEG

Add code
Sep 05, 2024
Viaarxiv icon

Position Paper: An Inner Interpretability Framework for AI Inspired by Lessons from Cognitive Neuroscience

Add code
Jun 03, 2024
Viaarxiv icon

Analyzing Vision Transformers for Image Classification in Class Embedding Space

Add code
Oct 29, 2023
Viaarxiv icon