Picture for Chenghao Deng

Chenghao Deng

EnsemW2S: Can an Ensemble of LLMs be Leveraged to Obtain a Stronger LLM?

Add code
Oct 06, 2024
Viaarxiv icon

Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization

Add code
Sep 27, 2024
Figure 1 for Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
Figure 2 for Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
Figure 3 for Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
Figure 4 for Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
Viaarxiv icon

Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies

Add code
Feb 20, 2024
Viaarxiv icon

Benchmarking the Robustness of Image Watermarks

Add code
Jan 22, 2024
Viaarxiv icon

Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making

Add code
Sep 07, 2023
Viaarxiv icon