Picture for Milad Nasr

Milad Nasr

Privacy Auditing of Large Language Models

Add code
Mar 09, 2025
Viaarxiv icon

AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses

Add code
Mar 03, 2025
Viaarxiv icon

Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards

Add code
Jan 13, 2025
Figure 1 for Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards
Figure 2 for Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards
Figure 3 for Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards
Figure 4 for Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards
Viaarxiv icon

On Evaluating the Durability of Safeguards for Open-Weight LLMs

Add code
Dec 10, 2024
Viaarxiv icon

SoK: Watermarking for AI-Generated Content

Add code
Nov 27, 2024
Viaarxiv icon

Remote Timing Attacks on Efficient Language Model Inference

Add code
Oct 22, 2024
Figure 1 for Remote Timing Attacks on Efficient Language Model Inference
Figure 2 for Remote Timing Attacks on Efficient Language Model Inference
Figure 3 for Remote Timing Attacks on Efficient Language Model Inference
Figure 4 for Remote Timing Attacks on Efficient Language Model Inference
Viaarxiv icon

The Last Iterate Advantage: Empirical Auditing and Principled Heuristic Analysis of Differentially Private SGD

Add code
Oct 10, 2024
Viaarxiv icon

Avoiding Generative Model Writer's Block With Embedding Nudging

Add code
Aug 28, 2024
Figure 1 for Avoiding Generative Model Writer's Block With Embedding Nudging
Figure 2 for Avoiding Generative Model Writer's Block With Embedding Nudging
Figure 3 for Avoiding Generative Model Writer's Block With Embedding Nudging
Figure 4 for Avoiding Generative Model Writer's Block With Embedding Nudging
Viaarxiv icon

Phantom: General Trigger Attacks on Retrieval Augmented Language Generation

Add code
May 30, 2024
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon