Picture for Chengyuan Yao

Chengyuan Yao

STEP3-VL-10B Technical Report

Add code
Jan 15, 2026
Viaarxiv icon

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Add code
Jan 09, 2026
Viaarxiv icon

Reward Shaping to Mitigate Reward Hacking in RLHF

Add code
Feb 26, 2025
Viaarxiv icon

Improving Robust Fairness via Balance Adversarial Training

Add code
Sep 15, 2022
Figure 1 for Improving Robust Fairness via Balance Adversarial Training
Figure 2 for Improving Robust Fairness via Balance Adversarial Training
Figure 3 for Improving Robust Fairness via Balance Adversarial Training
Figure 4 for Improving Robust Fairness via Balance Adversarial Training
Viaarxiv icon

Automated Discovery of Adaptive Attacks on Adversarial Defenses

Add code
Feb 27, 2021
Figure 1 for Automated Discovery of Adaptive Attacks on Adversarial Defenses
Figure 2 for Automated Discovery of Adaptive Attacks on Adversarial Defenses
Figure 3 for Automated Discovery of Adaptive Attacks on Adversarial Defenses
Figure 4 for Automated Discovery of Adaptive Attacks on Adversarial Defenses
Viaarxiv icon

Deep Learning for Post-Processing Ensemble Weather Forecasts

Add code
May 18, 2020
Figure 1 for Deep Learning for Post-Processing Ensemble Weather Forecasts
Figure 2 for Deep Learning for Post-Processing Ensemble Weather Forecasts
Figure 3 for Deep Learning for Post-Processing Ensemble Weather Forecasts
Figure 4 for Deep Learning for Post-Processing Ensemble Weather Forecasts
Viaarxiv icon