Picture for Ilia Shumailov

Ilia Shumailov

Hardware and Software Platform Inference

Add code
Nov 07, 2024
Viaarxiv icon

Stealing User Prompts from Mixture of Experts

Add code
Oct 30, 2024
Viaarxiv icon

Measuring memorization through probabilistic discoverable extraction

Add code
Oct 25, 2024
Viaarxiv icon

Operationalizing Contextual Integrity in Privacy-Conscious Assistants

Add code
Aug 05, 2024
Figure 1 for Operationalizing Contextual Integrity in Privacy-Conscious Assistants
Figure 2 for Operationalizing Contextual Integrity in Privacy-Conscious Assistants
Figure 3 for Operationalizing Contextual Integrity in Privacy-Conscious Assistants
Figure 4 for Operationalizing Contextual Integrity in Privacy-Conscious Assistants
Viaarxiv icon

A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses

Add code
Jul 02, 2024
Viaarxiv icon

UnUnlearning: Unlearning is not sufficient for content regulation in advanced generative AI

Add code
Jun 27, 2024
Viaarxiv icon

Measuring memorization in RLHF for code completion

Add code
Jun 17, 2024
Viaarxiv icon

Beyond Slow Signs in High-fidelity Model Extraction

Add code
Jun 14, 2024
Viaarxiv icon

Locking Machine Learning Models into Hardware

Add code
May 31, 2024
Viaarxiv icon

Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias

Add code
Mar 12, 2024
Viaarxiv icon