Picture for Jiashu Yao

Jiashu Yao

HomeBench: Evaluating LLMs in Smart Homes with Valid and Invalid Instructions Across Single and Multiple Devices

Add code
May 26, 2025
Viaarxiv icon

ReFF: Reinforcing Format Faithfulness in Language Models across Varied Tasks

Add code
Dec 12, 2024
Viaarxiv icon

Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via Plan Augmentation

Add code
Oct 22, 2024
Figure 1 for Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via Plan Augmentation
Figure 2 for Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via Plan Augmentation
Figure 3 for Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via Plan Augmentation
Figure 4 for Optimizing Chain-of-Thought Reasoning: Tackling Arranging Bottleneck via Plan Augmentation
Viaarxiv icon

FAME: Towards Factual Multi-Task Model Editing

Add code
Oct 07, 2024
Figure 1 for FAME: Towards Factual Multi-Task Model Editing
Figure 2 for FAME: Towards Factual Multi-Task Model Editing
Figure 3 for FAME: Towards Factual Multi-Task Model Editing
Figure 4 for FAME: Towards Factual Multi-Task Model Editing
Viaarxiv icon

Deterministic Reversible Data Augmentation for Neural Machine Translation

Add code
Jun 04, 2024
Viaarxiv icon