Picture for Qiaozhu Mei

Qiaozhu Mei

Shammie

How Different AI Chatbots Behave? Benchmarking Large Language Models in Behavioral Economics Games

Add code
Dec 16, 2024
Viaarxiv icon

Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach

Add code
Jul 23, 2024
Viaarxiv icon

Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions

Add code
Jun 17, 2024
Viaarxiv icon

MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows

Add code
Jun 10, 2024
Figure 1 for MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows
Figure 2 for MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows
Figure 3 for MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows
Figure 4 for MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows
Viaarxiv icon

Unlocking the `Why' of Buying: Introducing a New Dataset and Benchmark for Purchase Reason and Post-Purchase Experience

Add code
Feb 20, 2024
Viaarxiv icon

PRewrite: Prompt Rewriting with Reinforcement Learning

Add code
Jan 16, 2024
Viaarxiv icon

Bridging the Preference Gap between Retrievers and LLMs

Add code
Jan 13, 2024
Viaarxiv icon

A Turing Test: Are AI Chatbots Behaviorally Similar to Humans?

Add code
Nov 19, 2023
Viaarxiv icon

A Metadata-Driven Approach to Understand Graph Neural Networks

Add code
Oct 30, 2023
Viaarxiv icon

Meta Semantic Template for Evaluation of Large Language Models

Add code
Oct 19, 2023
Viaarxiv icon