Picture for Alexander Wei

Alexander Wei

Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation

Add code
Jun 28, 2024
Figure 1 for Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation
Figure 2 for Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation
Figure 3 for Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation
Figure 4 for Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation
Viaarxiv icon

Jailbroken: How Does LLM Safety Training Fail?

Add code
Jul 05, 2023
Viaarxiv icon

TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels

Add code
Jul 13, 2022
Figure 1 for TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels
Figure 2 for TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels
Figure 3 for TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels
Figure 4 for TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels
Viaarxiv icon

More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize

Add code
Mar 11, 2022
Figure 1 for More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize
Figure 2 for More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize
Figure 3 for More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize
Figure 4 for More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize
Viaarxiv icon

Predicting Out-of-Distribution Error with the Projection Norm

Add code
Feb 11, 2022
Figure 1 for Predicting Out-of-Distribution Error with the Projection Norm
Figure 2 for Predicting Out-of-Distribution Error with the Projection Norm
Figure 3 for Predicting Out-of-Distribution Error with the Projection Norm
Figure 4 for Predicting Out-of-Distribution Error with the Projection Norm
Viaarxiv icon

Learning Equilibria in Matching Markets from Bandit Feedback

Add code
Aug 19, 2021
Figure 1 for Learning Equilibria in Matching Markets from Bandit Feedback
Figure 2 for Learning Equilibria in Matching Markets from Bandit Feedback
Viaarxiv icon

Optimal Robustness-Consistency Trade-offs for Learning-Augmented Online Algorithms

Add code
Oct 22, 2020
Figure 1 for Optimal Robustness-Consistency Trade-offs for Learning-Augmented Online Algorithms
Figure 2 for Optimal Robustness-Consistency Trade-offs for Learning-Augmented Online Algorithms
Viaarxiv icon