Picture for Chengyuan Yao

Chengyuan Yao

Reward Shaping to Mitigate Reward Hacking in RLHF

Add code
Feb 26, 2025
Viaarxiv icon

Improving Robust Fairness via Balance Adversarial Training

Add code
Sep 15, 2022
Figure 1 for Improving Robust Fairness via Balance Adversarial Training
Figure 2 for Improving Robust Fairness via Balance Adversarial Training
Figure 3 for Improving Robust Fairness via Balance Adversarial Training
Figure 4 for Improving Robust Fairness via Balance Adversarial Training
Viaarxiv icon

Automated Discovery of Adaptive Attacks on Adversarial Defenses

Add code
Feb 27, 2021
Figure 1 for Automated Discovery of Adaptive Attacks on Adversarial Defenses
Figure 2 for Automated Discovery of Adaptive Attacks on Adversarial Defenses
Figure 3 for Automated Discovery of Adaptive Attacks on Adversarial Defenses
Figure 4 for Automated Discovery of Adaptive Attacks on Adversarial Defenses
Viaarxiv icon

Deep Learning for Post-Processing Ensemble Weather Forecasts

Add code
May 18, 2020
Figure 1 for Deep Learning for Post-Processing Ensemble Weather Forecasts
Figure 2 for Deep Learning for Post-Processing Ensemble Weather Forecasts
Figure 3 for Deep Learning for Post-Processing Ensemble Weather Forecasts
Figure 4 for Deep Learning for Post-Processing Ensemble Weather Forecasts
Viaarxiv icon