Picture for Wei Jie Yeo

Wei Jie Yeo

Towards Faithful Natural Language Explanations: A Study Using Activation Patching in Large Language Models

Add code
Oct 18, 2024
Viaarxiv icon

Self-training Large Language Models through Knowledge Detection

Add code
Jun 17, 2024
Viaarxiv icon

How Interpretable are Reasoning Explanations from Prompting Large Language Models?

Add code
Feb 25, 2024
Viaarxiv icon

Plausible Extractive Rationalization through Semi-Supervised Entailment Signal

Add code
Feb 25, 2024
Viaarxiv icon

A Comprehensive Review on Financial Explainable AI

Add code
Sep 21, 2023
Viaarxiv icon