Picture for Fan Yin

Fan Yin

Evaluating Human Alignment and Model Faithfulness of LLM Rationale

Add code
Jun 28, 2024
Viaarxiv icon

Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation

Add code
Jun 19, 2024
Viaarxiv icon

Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller

Add code
Jun 04, 2024
Viaarxiv icon

Enhancing Large Vision Language Models with Self-Training on Image Comprehension

Add code
May 30, 2024
Viaarxiv icon

Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension

Add code
Feb 28, 2024
Viaarxiv icon

Contrastive Instruction Tuning

Add code
Feb 17, 2024
Viaarxiv icon

Prompt-Driven LLM Safeguarding via Directed Representation Optimization

Add code
Jan 31, 2024
Figure 1 for Prompt-Driven LLM Safeguarding via Directed Representation Optimization
Figure 2 for Prompt-Driven LLM Safeguarding via Directed Representation Optimization
Figure 3 for Prompt-Driven LLM Safeguarding via Directed Representation Optimization
Figure 4 for Prompt-Driven LLM Safeguarding via Directed Representation Optimization
Viaarxiv icon

Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks

Add code
Nov 01, 2023
Viaarxiv icon

Should I Stop or Should I Go: Early Stopping with Heterogeneous Populations

Add code
Jun 26, 2023
Viaarxiv icon

Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning

Add code
Jun 01, 2023
Viaarxiv icon