Picture for Milan Bhan

Milan Bhan

Mitigating Text Toxicity with Counterfactual Generation

Add code
May 16, 2024
Figure 1 for Mitigating Text Toxicity with Counterfactual Generation
Figure 2 for Mitigating Text Toxicity with Counterfactual Generation
Figure 3 for Mitigating Text Toxicity with Counterfactual Generation
Figure 4 for Mitigating Text Toxicity with Counterfactual Generation
Viaarxiv icon

Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations

Add code
Feb 19, 2024
Viaarxiv icon

TIGTEC : Token Importance Guided TExt Counterfactuals

Add code
Apr 24, 2023
Figure 1 for TIGTEC : Token Importance Guided TExt Counterfactuals
Figure 2 for TIGTEC : Token Importance Guided TExt Counterfactuals
Figure 3 for TIGTEC : Token Importance Guided TExt Counterfactuals
Figure 4 for TIGTEC : Token Importance Guided TExt Counterfactuals
Viaarxiv icon

Evaluating self-attention interpretability through human-grounded experimental protocol

Add code
Mar 27, 2023
Viaarxiv icon