Picture for Raz Lapid

Raz Lapid

BenchOverflow: Measuring Overflow in Large Language Models via Plain-Text Prompts

Add code
Jan 13, 2026
Viaarxiv icon

Activation Steering for Masked Diffusion Language Models

Add code
Dec 30, 2025
Viaarxiv icon

Breaking Audio Large Language Models by Attacking Only the Encoder: A Universal Targeted Latent-Space Audio Attack

Add code
Dec 29, 2025
Viaarxiv icon

You Had One Job: Per-Task Quantization Using LLMs' Hidden Representations

Add code
Nov 09, 2025
Figure 1 for You Had One Job: Per-Task Quantization Using LLMs' Hidden Representations
Figure 2 for You Had One Job: Per-Task Quantization Using LLMs' Hidden Representations
Viaarxiv icon

Don't Lag, RAG: Training-Free Adversarial Detection Using RAG

Add code
Apr 07, 2025
Viaarxiv icon

Pulling Back the Curtain: Unsupervised Adversarial Detection via Contrastive Auxiliary Networks

Add code
Feb 13, 2025
Viaarxiv icon

On the Robustness of Kolmogorov-Arnold Networks: An Adversarial Perspective

Add code
Aug 25, 2024
Viaarxiv icon

Fortify the Guardian, Not the Treasure: Resilient Adversarial Detectors

Add code
Apr 18, 2024
Viaarxiv icon

XAI-Based Detection of Adversarial Attacks on Deepfake Detectors

Add code
Mar 05, 2024
Figure 1 for XAI-Based Detection of Adversarial Attacks on Deepfake Detectors
Figure 2 for XAI-Based Detection of Adversarial Attacks on Deepfake Detectors
Figure 3 for XAI-Based Detection of Adversarial Attacks on Deepfake Detectors
Figure 4 for XAI-Based Detection of Adversarial Attacks on Deepfake Detectors
Viaarxiv icon

Open Sesame! Universal Black Box Jailbreaking of Large Language Models

Add code
Sep 17, 2023
Figure 1 for Open Sesame! Universal Black Box Jailbreaking of Large Language Models
Figure 2 for Open Sesame! Universal Black Box Jailbreaking of Large Language Models
Figure 3 for Open Sesame! Universal Black Box Jailbreaking of Large Language Models
Figure 4 for Open Sesame! Universal Black Box Jailbreaking of Large Language Models
Viaarxiv icon