Picture for Yusen Zhang

Yusen Zhang

GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

Add code
Dec 12, 2024
Viaarxiv icon

Coverage-based Fairness in Multi-document Summarization

Add code
Dec 11, 2024
Viaarxiv icon

VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information

Add code
Dec 01, 2024
Viaarxiv icon

Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models

Add code
Nov 12, 2024
Figure 1 for Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Figure 2 for Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Figure 3 for Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Figure 4 for Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Viaarxiv icon

AAAR-1.0: Assessing AI's Potential to Assist Research

Add code
Oct 29, 2024
Viaarxiv icon

Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models

Add code
Jun 10, 2024
Viaarxiv icon

Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

Add code
Jun 04, 2024
Viaarxiv icon

When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs

Add code
Jun 03, 2024
Viaarxiv icon

Evaluating LLMs at Detecting Errors in LLM Responses

Add code
Apr 04, 2024
Viaarxiv icon

A General Benchmark Framework is Dynamic Graph Neural Network Need

Add code
Jan 12, 2024
Viaarxiv icon