Picture for Victor Legrand

Victor Legrand

Mitigating Text Toxicity with Counterfactual Generation

Add code
May 16, 2024
Viaarxiv icon

Evaluating self-attention interpretability through human-grounded experimental protocol

Add code
Mar 27, 2023
Viaarxiv icon