Picture for Jihan Yao

Jihan Yao

Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only

Add code
Oct 14, 2024
Figure 1 for Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Figure 2 for Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Figure 3 for Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Figure 4 for Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Viaarxiv icon

Know Your Limits: A Survey of Abstention in Large Language Models

Add code
Aug 08, 2024
Figure 1 for Know Your Limits: A Survey of Abstention in Large Language Models
Figure 2 for Know Your Limits: A Survey of Abstention in Large Language Models
Figure 3 for Know Your Limits: A Survey of Abstention in Large Language Models
Figure 4 for Know Your Limits: A Survey of Abstention in Large Language Models
Viaarxiv icon

The Art of Refusal: A Survey of Abstention in Large Language Models

Add code
Jul 25, 2024
Figure 1 for The Art of Refusal: A Survey of Abstention in Large Language Models
Figure 2 for The Art of Refusal: A Survey of Abstention in Large Language Models
Figure 3 for The Art of Refusal: A Survey of Abstention in Large Language Models
Figure 4 for The Art of Refusal: A Survey of Abstention in Large Language Models
Viaarxiv icon

Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop

Add code
Feb 16, 2024
Figure 1 for Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop
Figure 2 for Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop
Figure 3 for Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop
Figure 4 for Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop
Viaarxiv icon

POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition

Add code
Feb 09, 2024
Figure 1 for POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Figure 2 for POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Figure 3 for POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Figure 4 for POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Viaarxiv icon