Picture for Yusen Zhang

Yusen Zhang

Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models

Add code
Nov 12, 2024
Viaarxiv icon

AAAR-1.0: Assessing AI's Potential to Assist Research

Add code
Oct 29, 2024
Viaarxiv icon

Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models

Add code
Jun 10, 2024
Viaarxiv icon

Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

Add code
Jun 04, 2024
Viaarxiv icon

When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs

Add code
Jun 03, 2024
Viaarxiv icon

Evaluating LLMs at Detecting Errors in LLM Responses

Add code
Apr 04, 2024
Viaarxiv icon

A General Benchmark Framework is Dynamic Graph Neural Network Need

Add code
Jan 12, 2024
Viaarxiv icon

Fair Abstractive Summarization of Diverse Perspectives

Add code
Nov 14, 2023
Viaarxiv icon

FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization

Add code
Nov 08, 2023
Viaarxiv icon

XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations

Add code
Jun 07, 2023
Viaarxiv icon