Picture for Shuyu Wei

Shuyu Wei

Taming the Thinker: Conditional Entropy Shaping for Adaptive LLM Reasoning

Add code
May 19, 2026
Viaarxiv icon

Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning

Add code
Feb 01, 2024
Viaarxiv icon

CDEval: A Benchmark for Measuring the Cultural Dimensions of Large Language Models

Add code
Nov 28, 2023
Figure 1 for CDEval: A Benchmark for Measuring the Cultural Dimensions of Large Language Models
Figure 2 for CDEval: A Benchmark for Measuring the Cultural Dimensions of Large Language Models
Figure 3 for CDEval: A Benchmark for Measuring the Cultural Dimensions of Large Language Models
Figure 4 for CDEval: A Benchmark for Measuring the Cultural Dimensions of Large Language Models
Viaarxiv icon