Picture for Yebowen Hu

Yebowen Hu

STRUX: An LLM for Decision-Making with Structured Explanations

Add code
Oct 16, 2024
Viaarxiv icon

DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning

Add code
Oct 02, 2024
Figure 1 for DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning
Figure 2 for DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning
Figure 3 for DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning
Figure 4 for DeFine: Enhancing LLM Decision-Making with Factor Profiles and Analogical Reasoning
Viaarxiv icon

When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives

Add code
Jun 17, 2024
Figure 1 for When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
Figure 2 for When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
Figure 3 for When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
Figure 4 for When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives
Viaarxiv icon

BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of Large Language Models

Add code
Jun 03, 2024
Figure 1 for BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of Large Language Models
Figure 2 for BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of Large Language Models
Figure 3 for BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of Large Language Models
Figure 4 for BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of Large Language Models
Viaarxiv icon

Can Large Language Models do Analytical Reasoning?

Add code
Mar 06, 2024
Viaarxiv icon

SportsMetrics: Blending Text and Numerical Data to Understand Information Fusion in LLMs

Add code
Feb 15, 2024
Viaarxiv icon

InFoBench: Evaluating Instruction Following Ability in Large Language Models

Add code
Jan 07, 2024
Viaarxiv icon

MeetingBank: A Benchmark Dataset for Meeting Summarization

Add code
May 27, 2023
Viaarxiv icon

Analyzing Influential Factors in Human Preference Judgments via GPT-4

Add code
May 24, 2023
Viaarxiv icon