Are self-explanations from Large Language Models faithful?

Add code
Jan 17, 2024
Figure 1 for Are self-explanations from Large Language Models faithful?
Figure 2 for Are self-explanations from Large Language Models faithful?
Figure 3 for Are self-explanations from Large Language Models faithful?
Figure 4 for Are self-explanations from Large Language Models faithful?

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: