Picture for Deng Cai

Deng Cai

Delving into the Reversal Curse: How Far Can Large Language Models Generalize?

Add code
Oct 24, 2024
Viaarxiv icon

A Survey on the Honesty of Large Language Models

Add code
Sep 27, 2024
Figure 1 for A Survey on the Honesty of Large Language Models
Figure 2 for A Survey on the Honesty of Large Language Models
Figure 3 for A Survey on the Honesty of Large Language Models
Figure 4 for A Survey on the Honesty of Large Language Models
Viaarxiv icon

From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint Tuning

Add code
Sep 03, 2024
Viaarxiv icon

DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems

Add code
Jul 15, 2024
Viaarxiv icon

TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation

Add code
Jul 13, 2024
Viaarxiv icon

GLBench: A Comprehensive Benchmark for Graph with Large Language Models

Add code
Jul 11, 2024
Viaarxiv icon

Not All Preference Pairs Are Created Equal: A Recipe for Annotation-Efficient Iterative Preference Learning

Add code
Jun 25, 2024
Viaarxiv icon

On the Transformations across Reward Model, Parameter Update, and In-Context Prompt

Add code
Jun 24, 2024
Viaarxiv icon

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

Add code
Jun 14, 2024
Viaarxiv icon

On the Worst Prompt Performance of Large Language Models

Add code
Jun 08, 2024
Viaarxiv icon