Picture for Shuhang Lin

Shuhang Lin

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense

Add code
Jan 05, 2025
Viaarxiv icon

FactTest: Factuality Testing in Large Language Models with Statistical Guarantees

Add code
Nov 04, 2024
Figure 1 for FactTest: Factuality Testing in Large Language Models with Statistical Guarantees
Figure 2 for FactTest: Factuality Testing in Large Language Models with Statistical Guarantees
Figure 3 for FactTest: Factuality Testing in Large Language Models with Statistical Guarantees
Figure 4 for FactTest: Factuality Testing in Large Language Models with Statistical Guarantees
Viaarxiv icon

Disentangling Logic: The Role of Context in Large Language Model Reasoning Capabilities

Add code
Jun 04, 2024
Viaarxiv icon

BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis

Add code
Apr 23, 2024
Figure 1 for BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis
Figure 2 for BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis
Figure 3 for BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis
Figure 4 for BattleAgent: Multi-modal Dynamic Emulation on Historical Battles to Complement Historical Analysis
Viaarxiv icon