Picture for Hongru Wang

Hongru Wang

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Add code
Oct 21, 2024
Viaarxiv icon

Analysing the Residual Stream of Language Models Under Knowledge Conflicts

Add code
Oct 21, 2024
Figure 1 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Figure 2 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Figure 3 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Figure 4 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Viaarxiv icon

MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models

Add code
Oct 16, 2024
Viaarxiv icon

Less is More: Making Smaller Language Models Competent Subgraph Retrievers for Multi-hop KGQA

Add code
Oct 08, 2024
Viaarxiv icon

SoP: Unlock the Power of Social Facilitation for Automatic Jailbreak Attack

Add code
Jul 02, 2024
Viaarxiv icon

Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization

Add code
Jun 17, 2024
Viaarxiv icon

OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst

Add code
Jun 14, 2024
Figure 1 for OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst
Figure 2 for OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst
Viaarxiv icon

AutoCV: Empowering Reasoning with Automated Process Labeling via Confidence Variation

Add code
May 29, 2024
Viaarxiv icon

Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges

Add code
May 17, 2024
Figure 1 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Figure 2 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Figure 3 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Figure 4 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Viaarxiv icon

Knowledge Conflicts for LLMs: A Survey

Add code
Mar 13, 2024
Viaarxiv icon