Picture for Linhao Yu

Linhao Yu

Self-Pluralising Culture Alignment for Large Language Models

Add code
Oct 16, 2024
Viaarxiv icon

CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models

Add code
Aug 19, 2024
Viaarxiv icon

LFED: A Literary Fiction Evaluation Dataset for Large Language Models

Add code
May 16, 2024
Viaarxiv icon

OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety

Add code
Mar 18, 2024
Viaarxiv icon

Identifying Multiple Personalities in Large Language Models with External Evaluation

Add code
Feb 22, 2024
Viaarxiv icon

Evaluating Large Language Models: A Comprehensive Survey

Add code
Oct 31, 2023
Viaarxiv icon

M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models

Add code
May 21, 2023
Viaarxiv icon