Picture for Lingrui Mei

Lingrui Mei

You Know What I'm Saying -- Jailbreak Attack via Implicit Reference

Add code
Oct 04, 2024
Viaarxiv icon

HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router

Add code
Oct 03, 2024
Figure 1 for HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router
Figure 2 for HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router
Figure 3 for HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router
Figure 4 for HiddenGuard: Fine-Grained Safe Generation with Specialized Representation Router
Viaarxiv icon

StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge Editing for Large Language Models

Add code
Sep 16, 2024
Viaarxiv icon

Adaptive Token Biaser: Knowledge Editing via Biasing Key Entities

Add code
Jun 18, 2024
Viaarxiv icon

"Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jailbreak

Add code
Jun 17, 2024
Viaarxiv icon

Decoding by Contrasting Knowledge: Enhancing LLMs' Confidence on Edited Facts

Add code
May 21, 2024
Viaarxiv icon

Is Factuality Decoding a Free Lunch for LLMs? Evaluation on Knowledge Editing Benchmark

Add code
Mar 30, 2024
Viaarxiv icon

Graph Descriptive Order Improves Reasoning with Large Language Model

Add code
Feb 24, 2024
Viaarxiv icon

LPNL: Scalable Link Prediction with Large Language Models

Add code
Feb 06, 2024
Viaarxiv icon

SLANG: New Concept Comprehension of Large Language Models

Add code
Feb 05, 2024
Viaarxiv icon