Picture for Varshini Subhash

Varshini Subhash

Why do universal adversarial attacks work on large language models?: Geometry might be the answer

Add code
Sep 01, 2023
Viaarxiv icon

Does the explanation satisfy your needs?: A unified view of properties of explanations

Add code
Nov 10, 2022
Viaarxiv icon