Picture for Hongru Wang

Hongru Wang

Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions

Add code
Nov 15, 2024
Figure 1 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 2 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 3 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Figure 4 for Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
Viaarxiv icon

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Add code
Oct 21, 2024
Figure 1 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Figure 2 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Figure 3 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Figure 4 for Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Viaarxiv icon

Analysing the Residual Stream of Language Models Under Knowledge Conflicts

Add code
Oct 21, 2024
Figure 1 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Figure 2 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Figure 3 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Figure 4 for Analysing the Residual Stream of Language Models Under Knowledge Conflicts
Viaarxiv icon

MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models

Add code
Oct 16, 2024
Figure 1 for MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
Figure 2 for MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
Figure 3 for MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
Figure 4 for MlingConf: A Comprehensive Study of Multilingual Confidence Estimation on Large Language Models
Viaarxiv icon

Less is More: Making Smaller Language Models Competent Subgraph Retrievers for Multi-hop KGQA

Add code
Oct 08, 2024
Viaarxiv icon

SoP: Unlock the Power of Social Facilitation for Automatic Jailbreak Attack

Add code
Jul 02, 2024
Figure 1 for SoP: Unlock the Power of Social Facilitation for Automatic Jailbreak Attack
Figure 2 for SoP: Unlock the Power of Social Facilitation for Automatic Jailbreak Attack
Figure 3 for SoP: Unlock the Power of Social Facilitation for Automatic Jailbreak Attack
Figure 4 for SoP: Unlock the Power of Social Facilitation for Automatic Jailbreak Attack
Viaarxiv icon

Enhancing Biomedical Knowledge Retrieval-Augmented Generation with Self-Rewarding Tree Search and Proximal Policy Optimization

Add code
Jun 17, 2024
Viaarxiv icon

OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst

Add code
Jun 14, 2024
Figure 1 for OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst
Figure 2 for OSPC: Detecting Harmful Memes with Large Language Model as a Catalyst
Viaarxiv icon

AutoCV: Empowering Reasoning with Automated Process Labeling via Confidence Variation

Add code
May 29, 2024
Viaarxiv icon

Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges

Add code
May 17, 2024
Figure 1 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Figure 2 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Figure 3 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Figure 4 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Viaarxiv icon