Picture for Edoardo Mosca

Edoardo Mosca

Simpler becomes Harder: Do LLMs Exhibit a Coherent Behavior on Simplified Corpora?

Add code
Apr 10, 2024
Viaarxiv icon

IFAN: An Explainability-Focused Interaction Framework for Humans and NLP Models

Add code
Mar 06, 2023
Viaarxiv icon

"That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks

Add code
Apr 10, 2022
Figure 1 for "That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks
Figure 2 for "That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks
Figure 3 for "That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks
Figure 4 for "That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks
Viaarxiv icon