Picture for Kunhe Yang

Kunhe Yang

Learning Local Stackelberg Equilibria from Repeated Interactions with a Learning Agent

Add code
Oct 26, 2025
Viaarxiv icon

Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?

Add code
May 29, 2025
Figure 1 for Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?
Figure 2 for Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?
Figure 3 for Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?
Figure 4 for Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?
Viaarxiv icon

Is Knowledge Power? On the (Im)possibility of Learning from Strategic Interaction

Add code
Aug 15, 2024
Figure 1 for Is Knowledge Power? On the (Im)possibility of Learning from Strategic Interaction
Figure 2 for Is Knowledge Power? On the (Im)possibility of Learning from Strategic Interaction
Figure 3 for Is Knowledge Power? On the (Im)possibility of Learning from Strategic Interaction
Viaarxiv icon

Truthfulness of Calibration Measures

Add code
Jul 19, 2024
Figure 1 for Truthfulness of Calibration Measures
Figure 2 for Truthfulness of Calibration Measures
Viaarxiv icon

Strategic Littlestone Dimension: Improved Bounds on Online Strategic Classification

Add code
Jul 16, 2024
Figure 1 for Strategic Littlestone Dimension: Improved Bounds on Online Strategic Classification
Viaarxiv icon

Calibrated Stackelberg Games: Learning Optimal Commitments Against Calibrated Agents

Add code
Jun 05, 2023
Figure 1 for Calibrated Stackelberg Games: Learning Optimal Commitments Against Calibrated Agents
Viaarxiv icon

Fundamental Bounds on Online Strategic Classification

Add code
Feb 23, 2023
Figure 1 for Fundamental Bounds on Online Strategic Classification
Figure 2 for Fundamental Bounds on Online Strategic Classification
Viaarxiv icon

Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian

Add code
Nov 01, 2022
Viaarxiv icon

Oracle-Efficient Online Learning for Beyond Worst-Case Adversaries

Add code
Mar 08, 2022
Viaarxiv icon

$Q$-learning with Logarithmic Regret

Add code
Jun 16, 2020
Viaarxiv icon