Picture for Oliver J. Sutton

Oliver J. Sutton

Stealth edits for provably fixing or attacking large language models

Add code
Jun 18, 2024
Figure 1 for Stealth edits for provably fixing or attacking large language models
Figure 2 for Stealth edits for provably fixing or attacking large language models
Figure 3 for Stealth edits for provably fixing or attacking large language models
Figure 4 for Stealth edits for provably fixing or attacking large language models
Viaarxiv icon

Weakly Supervised Learners for Correction of AI Errors with Provable Performance Guarantees

Add code
Feb 06, 2024
Viaarxiv icon

How adversarial attacks can disrupt seemingly stable accurate classifiers

Add code
Sep 07, 2023
Figure 1 for How adversarial attacks can disrupt seemingly stable accurate classifiers
Figure 2 for How adversarial attacks can disrupt seemingly stable accurate classifiers
Figure 3 for How adversarial attacks can disrupt seemingly stable accurate classifiers
Figure 4 for How adversarial attacks can disrupt seemingly stable accurate classifiers
Viaarxiv icon

Towards a mathematical understanding of learning from few examples with nonlinear feature maps

Add code
Nov 07, 2022
Viaarxiv icon