Picture for Kangkang Zhao

Kangkang Zhao

SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in Chinese

Add code
Feb 02, 2024
Viaarxiv icon

SC-Safety: A Multi-round Open-ended Question Adversarial Safety Benchmark for Large Language Models in Chinese

Add code
Oct 09, 2023
Figure 1 for SC-Safety: A Multi-round Open-ended Question Adversarial Safety Benchmark for Large Language Models in Chinese
Figure 2 for SC-Safety: A Multi-round Open-ended Question Adversarial Safety Benchmark for Large Language Models in Chinese
Figure 3 for SC-Safety: A Multi-round Open-ended Question Adversarial Safety Benchmark for Large Language Models in Chinese
Figure 4 for SC-Safety: A Multi-round Open-ended Question Adversarial Safety Benchmark for Large Language Models in Chinese
Viaarxiv icon

SuperCLUE: A Comprehensive Chinese Large Language Model Benchmark

Add code
Jul 27, 2023
Viaarxiv icon