Picture for Yauwai Yim

Yauwai Yim

Persona Knowledge-Aligned Prompt Tuning Method for Online Debate

Add code
Oct 05, 2024
Viaarxiv icon

ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities

Add code
Oct 04, 2024
Viaarxiv icon

Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information

Add code
Aug 05, 2024
Viaarxiv icon

CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge

Add code
Jul 30, 2024
Viaarxiv icon

Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction

Add code
Apr 22, 2024
Viaarxiv icon

NegotiationToM: A Benchmark for Stress-testing Machine Theory of Mind on Negotiation Surrounding

Add code
Apr 21, 2024
Viaarxiv icon