Picture for Claudia Tang

Claudia Tang

VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation

Add code
Jun 26, 2024
Figure 1 for VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation
Figure 2 for VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation
Figure 3 for VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation
Figure 4 for VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation
Viaarxiv icon