Picture for Victor Legrand

Victor Legrand

Mitigating Text Toxicity with Counterfactual Generation

Add code
May 16, 2024
Figure 1 for Mitigating Text Toxicity with Counterfactual Generation
Figure 2 for Mitigating Text Toxicity with Counterfactual Generation
Figure 3 for Mitigating Text Toxicity with Counterfactual Generation
Figure 4 for Mitigating Text Toxicity with Counterfactual Generation
Viaarxiv icon

Evaluating self-attention interpretability through human-grounded experimental protocol

Add code
Mar 27, 2023
Viaarxiv icon