Picture for Valentyn Boreiko

Valentyn Boreiko

A Realistic Threat Model for Large Language Model Jailbreaks

Add code
Oct 21, 2024
Viaarxiv icon

How much can we forget about Data Contamination?

Add code
Oct 04, 2024
Viaarxiv icon

Identification of Fine-grained Systematic Errors via Controlled Scene Generation

Add code
Apr 10, 2024
Viaarxiv icon

Generating Realistic Counterfactuals for Retinal Fundus and OCT Images using Diffusion Models

Add code
Dec 04, 2023
Viaarxiv icon

Identifying Systematic Errors in Object Detectors with the SCROD Pipeline

Add code
Sep 23, 2023
Viaarxiv icon

Identification of Systematic Errors of Image Classifiers on Rare Subgroups

Add code
Mar 09, 2023
Viaarxiv icon

Spurious Features Everywhere -- Large-Scale Detection of Harmful Spurious Features in ImageNet

Add code
Dec 09, 2022
Viaarxiv icon

Sparse Visual Counterfactual Explanations in Image Space

Add code
May 16, 2022
Figure 1 for Sparse Visual Counterfactual Explanations in Image Space
Figure 2 for Sparse Visual Counterfactual Explanations in Image Space
Figure 3 for Sparse Visual Counterfactual Explanations in Image Space
Figure 4 for Sparse Visual Counterfactual Explanations in Image Space
Viaarxiv icon