Picture for Yisroel Mirsky

Yisroel Mirsky

LeakBoost: Perceptual-Loss-Based Membership Inference Attack

Add code
Feb 05, 2026
Viaarxiv icon

GAVEL: Towards rule-based safety through activation monitoring

Add code
Jan 29, 2026
Viaarxiv icon

Love, Lies, and Language Models: Investigating AI's Role in Romance-Baiting Scams

Add code
Dec 22, 2025
Viaarxiv icon

Trust Me, I Know This Function: Hijacking LLM Static Analysis using Bias

Add code
Aug 24, 2025
Viaarxiv icon

PaniCar: Securing the Perception of Advanced Driving Assistance Systems Against Emergency Vehicle Lighting

Add code
May 08, 2025
Figure 1 for PaniCar: Securing the Perception of Advanced Driving Assistance Systems Against Emergency Vehicle Lighting
Figure 2 for PaniCar: Securing the Perception of Advanced Driving Assistance Systems Against Emergency Vehicle Lighting
Figure 3 for PaniCar: Securing the Perception of Advanced Driving Assistance Systems Against Emergency Vehicle Lighting
Figure 4 for PaniCar: Securing the Perception of Advanced Driving Assistance Systems Against Emergency Vehicle Lighting
Viaarxiv icon

Memory Backdoor Attacks on Neural Networks

Add code
Nov 21, 2024
Figure 1 for Memory Backdoor Attacks on Neural Networks
Figure 2 for Memory Backdoor Attacks on Neural Networks
Figure 3 for Memory Backdoor Attacks on Neural Networks
Figure 4 for Memory Backdoor Attacks on Neural Networks
Viaarxiv icon

Efficient Model Extraction via Boundary Sampling

Add code
Oct 20, 2024
Viaarxiv icon

PEAS: A Strategy for Crafting Transferable Adversarial Examples

Add code
Oct 20, 2024
Viaarxiv icon

Are You Human? An Adversarial Benchmark to Expose LLMs

Add code
Oct 12, 2024
Viaarxiv icon

Back-in-Time Diffusion: Unsupervised Detection of Medical Deepfakes

Add code
Jul 21, 2024
Figure 1 for Back-in-Time Diffusion: Unsupervised Detection of Medical Deepfakes
Figure 2 for Back-in-Time Diffusion: Unsupervised Detection of Medical Deepfakes
Figure 3 for Back-in-Time Diffusion: Unsupervised Detection of Medical Deepfakes
Figure 4 for Back-in-Time Diffusion: Unsupervised Detection of Medical Deepfakes
Viaarxiv icon