Picture for Qiongyu Li

Qiongyu Li

UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions

Add code
Jun 18, 2024
Figure 1 for UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions
Figure 2 for UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions
Figure 3 for UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions
Figure 4 for UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions
Viaarxiv icon