Picture for Benjamin I. P. Rubinstein

Benjamin I. P. Rubinstein

CERT-ED: Certifiably Robust Text Classification for Edit Distance

Add code
Aug 01, 2024
Viaarxiv icon

Adaptive Data Analysis for Growing Data

Add code
May 22, 2024
Viaarxiv icon

SEEP: Training Dynamics Grounds Latent Representation Search for Mitigating Backdoor Poisoning Attacks

Add code
May 19, 2024
Viaarxiv icon

RS-Reg: Probabilistic and Robust Certified Regression Through Randomized Smoothing

Add code
May 14, 2024
Figure 1 for RS-Reg: Probabilistic and Robust Certified Regression Through Randomized Smoothing
Figure 2 for RS-Reg: Probabilistic and Robust Certified Regression Through Randomized Smoothing
Figure 3 for RS-Reg: Probabilistic and Robust Certified Regression Through Randomized Smoothing
Figure 4 for RS-Reg: Probabilistic and Robust Certified Regression Through Randomized Smoothing
Viaarxiv icon

Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning

Add code
Apr 30, 2024
Figure 1 for Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Figure 2 for Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Figure 3 for Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Figure 4 for Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Viaarxiv icon

Backdoor Attack on Multilingual Machine Translation

Add code
Apr 03, 2024
Viaarxiv icon

It's Simplex! Disaggregating Measures to Improve Certified Robustness

Add code
Sep 20, 2023
Figure 1 for It's Simplex! Disaggregating Measures to Improve Certified Robustness
Figure 2 for It's Simplex! Disaggregating Measures to Improve Certified Robustness
Figure 3 for It's Simplex! Disaggregating Measures to Improve Certified Robustness
Figure 4 for It's Simplex! Disaggregating Measures to Improve Certified Robustness
Viaarxiv icon

Enhancing the Antidote: Improved Pointwise Certifications against Poisoning Attacks

Add code
Aug 15, 2023
Viaarxiv icon

Exploiting Certified Defences to Attack Randomised Smoothing

Add code
Feb 09, 2023
Viaarxiv icon

Certified Robustness of Learning-based Static Malware Detectors

Add code
Jan 31, 2023
Viaarxiv icon