Picture for Bosi Wen

Bosi Wen

Benchmarking Complex Instruction-Following with Multiple Constraints Composition

Add code
Jul 04, 2024
Viaarxiv icon

ToMBench: Benchmarking Theory of Mind in Large Language Models

Add code
Feb 23, 2024
Viaarxiv icon

AlignBench: Benchmarking Chinese Alignment of Large Language Models

Add code
Dec 05, 2023
Viaarxiv icon

CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation

Add code
Nov 30, 2023
Viaarxiv icon

CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models

Add code
Nov 28, 2023
Viaarxiv icon

EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training

Add code
Aug 03, 2021
Figure 1 for EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training
Figure 2 for EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training
Figure 3 for EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training
Figure 4 for EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training
Viaarxiv icon