Picture for Shuhang Lin

Shuhang Lin

FactTest: Factuality Testing in Large Language Models with Statistical Guarantees

Add code
Nov 04, 2024
Viaarxiv icon

Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities

Add code
Jun 04, 2024
Viaarxiv icon

BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis

Add code
Apr 23, 2024
Viaarxiv icon