Picture for Chonghua Wang

Chonghua Wang

Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks

Add code
Apr 10, 2024
Viaarxiv icon

BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues

Add code
Oct 20, 2023
Viaarxiv icon