Picture for Jamie Hayes

Jamie Hayes

Dj

$(\varepsilon, δ)$ Considered Harmful: Best Practices for Reporting Differential Privacy Guarantees

Add code
Mar 13, 2025
Viaarxiv icon

Interpreting the Repeated Token Phenomenon in Large Language Models

Add code
Mar 11, 2025
Viaarxiv icon

Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice

Add code
Dec 09, 2024
Figure 1 for Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice
Figure 2 for Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice
Figure 3 for Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice
Figure 4 for Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice
Viaarxiv icon

To Shuffle or not to Shuffle: Auditing DP-SGD with Shuffling

Add code
Nov 15, 2024
Viaarxiv icon

Stealing User Prompts from Mixture of Experts

Add code
Oct 30, 2024
Viaarxiv icon

Measuring memorization through probabilistic discoverable extraction

Add code
Oct 25, 2024
Figure 1 for Measuring memorization through probabilistic discoverable extraction
Figure 2 for Measuring memorization through probabilistic discoverable extraction
Figure 3 for Measuring memorization through probabilistic discoverable extraction
Figure 4 for Measuring memorization through probabilistic discoverable extraction
Viaarxiv icon

The Last Iterate Advantage: Empirical Auditing and Principled Heuristic Analysis of Differentially Private SGD

Add code
Oct 10, 2024
Viaarxiv icon

Imagen 3

Add code
Aug 13, 2024
Viaarxiv icon

UnUnlearning: Unlearning is not sufficient for content regulation in advanced generative AI

Add code
Jun 27, 2024
Viaarxiv icon

Measuring memorization in RLHF for code completion

Add code
Jun 17, 2024
Viaarxiv icon