Picture for Shojiro Yamabe

Shojiro Yamabe

Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability

Add code
Oct 01, 2025
Viaarxiv icon

Understanding Sensitivity of Differential Attention through the Lens of Adversarial Robustness

Add code
Oct 01, 2025
Viaarxiv icon

MergePrint: Robust Fingerprinting against Merging Large Language Models

Add code
Oct 11, 2024
Figure 1 for MergePrint: Robust Fingerprinting against Merging Large Language Models
Figure 2 for MergePrint: Robust Fingerprinting against Merging Large Language Models
Figure 3 for MergePrint: Robust Fingerprinting against Merging Large Language Models
Figure 4 for MergePrint: Robust Fingerprinting against Merging Large Language Models
Viaarxiv icon

Behavior-Targeted Attack on Reinforcement Learning with Limited Access to Victim's Policy

Add code
Jun 06, 2024
Viaarxiv icon