Picture for Zhiwei Steven Wu

Zhiwei Steven Wu

What Fits (Into Few Tokens) Doesn't Overfit: Compression and Generalization in ML Research Agents

Add code
Jun 09, 2026
Viaarxiv icon

From Curiosity to Caution: Mitigating Reward Hacking for Best-of-N with Pessimism

Add code
Apr 06, 2026
Viaarxiv icon

Back to Blackwell: Closing the Loop on Intransitivity in Multi-Objective Preference Fine-Tuning

Add code
Feb 22, 2026
Viaarxiv icon

Rubrics as an Attack Surface: Stealthy Preference Drift in LLM Judges

Add code
Feb 14, 2026
Viaarxiv icon

Gained in Translation: Privileged Pairwise Judges Enhance Multilingual Reasoning

Add code
Jan 26, 2026
Viaarxiv icon

Persona-Augmented Benchmarking: Evaluating LLMs Across Diverse Writing Styles

Add code
Jul 29, 2025
Viaarxiv icon

Measurement as Bricolage: Examining How Data Scientists Construct Target Variables for Predictive Modeling Tasks

Add code
Jul 03, 2025
Figure 1 for Measurement as Bricolage: Examining How Data Scientists Construct Target Variables for Predictive Modeling Tasks
Figure 2 for Measurement as Bricolage: Examining How Data Scientists Construct Target Variables for Predictive Modeling Tasks
Figure 3 for Measurement as Bricolage: Examining How Data Scientists Construct Target Variables for Predictive Modeling Tasks
Figure 4 for Measurement as Bricolage: Examining How Data Scientists Construct Target Variables for Predictive Modeling Tasks
Viaarxiv icon

Enhancing One-run Privacy Auditing with Quantile Regression-Based Membership Inference

Add code
Jun 18, 2025
Figure 1 for Enhancing One-run Privacy Auditing with Quantile Regression-Based Membership Inference
Figure 2 for Enhancing One-run Privacy Auditing with Quantile Regression-Based Membership Inference
Figure 3 for Enhancing One-run Privacy Auditing with Quantile Regression-Based Membership Inference
Viaarxiv icon

Membership Inference Attacks for Unseen Classes

Add code
Jun 06, 2025
Viaarxiv icon

Breaking the Gold Standard: Extracting Forgotten Data under Exact Unlearning in Large Language Models

Add code
May 30, 2025
Viaarxiv icon