Picture for Sachin Vashistha

Sachin Vashistha

PragWorld: A Benchmark Evaluating LLMs' Local World Model under Minimal Linguistic Alterations and Conversational Dynamics

Add code
Nov 17, 2025
Figure 1 for PragWorld: A Benchmark Evaluating LLMs' Local World Model under Minimal Linguistic Alterations and Conversational Dynamics
Figure 2 for PragWorld: A Benchmark Evaluating LLMs' Local World Model under Minimal Linguistic Alterations and Conversational Dynamics
Figure 3 for PragWorld: A Benchmark Evaluating LLMs' Local World Model under Minimal Linguistic Alterations and Conversational Dynamics
Figure 4 for PragWorld: A Benchmark Evaluating LLMs' Local World Model under Minimal Linguistic Alterations and Conversational Dynamics
Viaarxiv icon

SMAB: MAB based word Sensitivity Estimation Framework and its Applications in Adversarial Text Generation

Add code
Feb 10, 2025
Figure 1 for SMAB: MAB based word Sensitivity Estimation Framework and its Applications in Adversarial Text Generation
Figure 2 for SMAB: MAB based word Sensitivity Estimation Framework and its Applications in Adversarial Text Generation
Figure 3 for SMAB: MAB based word Sensitivity Estimation Framework and its Applications in Adversarial Text Generation
Figure 4 for SMAB: MAB based word Sensitivity Estimation Framework and its Applications in Adversarial Text Generation
Viaarxiv icon

Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks

Add code
May 24, 2023
Figure 1 for Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Figure 2 for Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Figure 3 for Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Figure 4 for Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Viaarxiv icon