Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Goh Siow Mong

How Interpretable are Reasoning Explanations from Prompting Large Language Models?

Feb 25, 2024

Wei Jie Yeo, Ranjan Satapathy, Goh Siow Mong, Rick, Erik Cambria

Figure 1 for How Interpretable are Reasoning Explanations from Prompting Large Language Models?

Figure 2 for How Interpretable are Reasoning Explanations from Prompting Large Language Models?

Figure 3 for How Interpretable are Reasoning Explanations from Prompting Large Language Models?

Figure 4 for How Interpretable are Reasoning Explanations from Prompting Large Language Models?

Abstract:Prompt Engineering has garnered significant attention for enhancing the performance of large language models across a multitude of tasks. Techniques such as the Chain-of-Thought not only bolster task performance but also delineate a clear trajectory of reasoning steps, offering a tangible form of explanation for the audience. Prior works on interpretability assess the reasoning chains yielded by Chain-of-Thought solely along a singular axis, namely faithfulness. We present a comprehensive and multifaceted evaluation of interpretability, examining not only faithfulness but also robustness and utility across multiple commonsense reasoning benchmarks. Likewise, our investigation is not confined to a single prompting technique; it expansively covers a multitude of prevalent prompting techniques employed in large language models, thereby ensuring a wide-ranging and exhaustive evaluation. In addition, we introduce a simple interpretability alignment technique, termed Self-Entailment-Alignment Chain-of-thought, that yields more than 70\% improvements across multiple dimensions of interpretability. Code is available at https://github.com/wj210/CoT_interpretability

* Under review

Via

Access Paper or Ask Questions