K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences

Add code
Aug 26, 2024
Figure 1 for K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences
Figure 2 for K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences
Figure 3 for K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences
Figure 4 for K-Sort Arena: Efficient and Reliable Benchmarking for Generative Models via K-wise Human Preferences

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: