Picture for Jihan Yao

Jihan Yao

Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only

Add code
Oct 14, 2024
Viaarxiv icon

Know Your Limits: A Survey of Abstention in Large Language Models

Add code
Aug 08, 2024
Viaarxiv icon

The Art of Refusal: A Survey of Abstention in Large Language Models

Add code
Jul 25, 2024
Viaarxiv icon

Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop

Add code
Feb 16, 2024
Figure 1 for Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop
Figure 2 for Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop
Figure 3 for Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop
Figure 4 for Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop
Viaarxiv icon

POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition

Add code
Feb 09, 2024
Viaarxiv icon