Picture for Xiaohua Wang

Xiaohua Wang

Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing

Add code
Sep 25, 2024
Figure 1 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 2 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 3 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Figure 4 for Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing
Viaarxiv icon

Power-LLaVA: Large Language and Vision Assistant for Power Transmission Line Inspection

Add code
Jul 27, 2024
Viaarxiv icon

Searching for Best Practices in Retrieval-Augmented Generation

Add code
Jul 01, 2024
Viaarxiv icon

Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query Refinement

Add code
Jul 01, 2024
Viaarxiv icon

Towards Biologically Plausible Computing: A Comprehensive Comparison

Add code
Jun 23, 2024
Viaarxiv icon

Promoting Data and Model Privacy in Federated Learning through Quantized LoRA

Add code
Jun 16, 2024
Viaarxiv icon

Advancing Parameter Efficiency in Fine-tuning via Representation Editing

Add code
Feb 28, 2024
Viaarxiv icon

Aligning Large Language Models with Human Preferences through Representation Engineering

Add code
Dec 26, 2023
Viaarxiv icon

VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation

Add code
Dec 14, 2023
Viaarxiv icon

FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language Models

Add code
Nov 16, 2023
Viaarxiv icon