Picture for Qiongkai Xu

Qiongkai Xu

WET: Overcoming Paraphrasing Vulnerabilities in Embeddings-as-a-Service with Linear Transformation Watermarks

Add code
Aug 29, 2024
Viaarxiv icon

IDT: Dual-Task Adversarial Attacks for Privacy Protection

Add code
Jun 28, 2024
Viaarxiv icon

NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human

Add code
Jun 06, 2024
Figure 1 for NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human
Figure 2 for NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human
Figure 3 for NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human
Figure 4 for NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human
Viaarxiv icon

Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients

Add code
Jun 03, 2024
Viaarxiv icon

SEEP: Training Dynamics Grounds Latent Representation Search for Mitigating Backdoor Poisoning Attacks

Add code
May 19, 2024
Viaarxiv icon

Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning

Add code
Apr 30, 2024
Viaarxiv icon

Attacks on Third-Party APIs of Large Language Models

Add code
Apr 24, 2024
Viaarxiv icon

Backdoor Attack on Multilingual Machine Translation

Add code
Apr 03, 2024
Viaarxiv icon

WARDEN: Multi-Directional Backdoor Watermarks for Embedding-as-a-Service Copyright Protection

Add code
Mar 03, 2024
Viaarxiv icon

Here's a Free Lunch: Sanitizing Backdoored Models with Model Merge

Add code
Feb 29, 2024
Viaarxiv icon