Picture for Arush Tagade

Arush Tagade

The SaTML '24 CNN Interpretability Competition: New Innovations for Concept-Level Interpretability

Add code
Apr 03, 2024
Viaarxiv icon

Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation

Add code
Nov 06, 2023
Viaarxiv icon

Prototype Generation: Robust Feature Visualisation for Data Independent Interpretability

Add code
Sep 29, 2023
Viaarxiv icon

Why do CNNs excel at feature extraction? A mathematical explanation

Add code
Jul 03, 2023
Viaarxiv icon