Picture for Zhiwei Steven Wu

Zhiwei Steven Wu

Position: LLM Unlearning Benchmarks are Weak Measures of Progress

Add code
Oct 03, 2024
Figure 1 for Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Figure 2 for Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Figure 3 for Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Figure 4 for Position: LLM Unlearning Benchmarks are Weak Measures of Progress
Viaarxiv icon

Multi-group Uncertainty Quantification for Long-form Text Generation

Add code
Jul 25, 2024
Viaarxiv icon

Jogging the Memory of Unlearned Model Through Targeted Relearning Attack

Add code
Jun 19, 2024
Viaarxiv icon

Multi-Agent Imitation Learning: Value is Easy, Regret is Hard

Add code
Jun 06, 2024
Figure 1 for Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Figure 2 for Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Figure 3 for Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Figure 4 for Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
Viaarxiv icon

Orthogonal Causal Calibration

Add code
Jun 04, 2024
Viaarxiv icon

Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift

Add code
Jun 02, 2024
Figure 1 for Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift
Figure 2 for Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift
Figure 3 for Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift
Figure 4 for Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift
Viaarxiv icon

Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable

Add code
May 30, 2024
Viaarxiv icon

Reconciling Model Multiplicity for Downstream Decision Making

Add code
May 30, 2024
Viaarxiv icon

Predictive Performance Comparison of Decision Policies Under Confounding

Add code
Apr 01, 2024
Figure 1 for Predictive Performance Comparison of Decision Policies Under Confounding
Figure 2 for Predictive Performance Comparison of Decision Policies Under Confounding
Figure 3 for Predictive Performance Comparison of Decision Policies Under Confounding
Figure 4 for Predictive Performance Comparison of Decision Policies Under Confounding
Viaarxiv icon

Provable Multi-Party Reinforcement Learning with Diverse Human Feedback

Add code
Mar 08, 2024
Viaarxiv icon