Picture for Yauwai Yim

Yauwai Yim

Persona Knowledge-Aligned Prompt Tuning Method for Online Debate

Add code
Oct 05, 2024
Figure 1 for Persona Knowledge-Aligned Prompt Tuning Method for Online Debate
Figure 2 for Persona Knowledge-Aligned Prompt Tuning Method for Online Debate
Figure 3 for Persona Knowledge-Aligned Prompt Tuning Method for Online Debate
Figure 4 for Persona Knowledge-Aligned Prompt Tuning Method for Online Debate
Viaarxiv icon

ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities

Add code
Oct 04, 2024
Figure 1 for ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities
Figure 2 for ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities
Figure 3 for ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities
Figure 4 for ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities
Viaarxiv icon

Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information

Add code
Aug 05, 2024
Figure 1 for Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information
Figure 2 for Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information
Figure 3 for Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information
Figure 4 for Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information
Viaarxiv icon

CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge

Add code
Jul 30, 2024
Figure 1 for CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge
Figure 2 for CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge
Figure 3 for CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge
Figure 4 for CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge
Viaarxiv icon

Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction

Add code
Apr 22, 2024
Figure 1 for Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction
Figure 2 for Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction
Figure 3 for Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction
Figure 4 for Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction
Viaarxiv icon

NegotiationToM: A Benchmark for Stress-testing Machine Theory of Mind on Negotiation Surrounding

Add code
Apr 21, 2024
Viaarxiv icon