Picture for Qiongkai Xu

Qiongkai Xu

Cut the Deadwood Out: Post-Training Model Purification with Selective Module Substitution

Add code
Dec 29, 2024
Viaarxiv icon

Overview of the 2024 ALTA Shared Task: Detect Automatic AI-Generated Sentences for Human-AI Hybrid Articles

Add code
Dec 19, 2024
Viaarxiv icon

WET: Overcoming Paraphrasing Vulnerabilities in Embeddings-as-a-Service with Linear Transformation Watermarks

Add code
Aug 29, 2024
Viaarxiv icon

IDT: Dual-Task Adversarial Attacks for Privacy Protection

Add code
Jun 28, 2024
Viaarxiv icon

NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human

Add code
Jun 06, 2024
Figure 1 for NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human
Figure 2 for NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human
Figure 3 for NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human
Figure 4 for NAP^2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human
Viaarxiv icon

Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients

Add code
Jun 03, 2024
Viaarxiv icon

SEEP: Training Dynamics Grounds Latent Representation Search for Mitigating Backdoor Poisoning Attacks

Add code
May 19, 2024
Viaarxiv icon

Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning

Add code
Apr 30, 2024
Figure 1 for Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Figure 2 for Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Figure 3 for Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Figure 4 for Transferring Troubles: Cross-Lingual Transferability of Backdoor Attacks in LLMs with Instruction Tuning
Viaarxiv icon

Attacks on Third-Party APIs of Large Language Models

Add code
Apr 24, 2024
Viaarxiv icon

Backdoor Attack on Multilingual Machine Translation

Add code
Apr 03, 2024
Viaarxiv icon