Picture for Tianshuo Cong

Tianshuo Cong

The Benchmark Illusion: Pruned LLMs Can Pass Multiple Choice but Fail to Answer

Add code
Jun 16, 2026
Viaarxiv icon

Evaluating Implicit Regulatory Compliance in LLM Tool Invocation via Logic-Guided Synthesis

Add code
Jan 13, 2026
Viaarxiv icon

GRPO Privacy Is at Risk: A Membership Inference Attack Against Reinforcement Learning With Verifiable Rewards

Add code
Nov 18, 2025
Figure 1 for GRPO Privacy Is at Risk: A Membership Inference Attack Against Reinforcement Learning With Verifiable Rewards
Figure 2 for GRPO Privacy Is at Risk: A Membership Inference Attack Against Reinforcement Learning With Verifiable Rewards
Figure 3 for GRPO Privacy Is at Risk: A Membership Inference Attack Against Reinforcement Learning With Verifiable Rewards
Figure 4 for GRPO Privacy Is at Risk: A Membership Inference Attack Against Reinforcement Learning With Verifiable Rewards
Viaarxiv icon

FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models

Add code
May 21, 2025
Figure 1 for FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models
Figure 2 for FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models
Figure 3 for FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models
Figure 4 for FragFake: A Dataset for Fine-Grained Detection of Edited Images with Vision Language Models
Viaarxiv icon

Behind the Tip of Efficiency: Uncovering the Submerged Threats of Jailbreak Attacks in Small Language Models

Add code
Feb 28, 2025
Viaarxiv icon

SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning

Add code
Feb 06, 2025
Figure 1 for SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning
Figure 2 for SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning
Figure 3 for SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning
Figure 4 for SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning
Viaarxiv icon

CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers

Add code
Dec 26, 2024
Figure 1 for CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers
Figure 2 for CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers
Figure 3 for CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers
Figure 4 for CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers
Viaarxiv icon

Jailbreak Attacks and Defenses Against Large Language Models: A Survey

Add code
Jul 05, 2024
Figure 1 for Jailbreak Attacks and Defenses Against Large Language Models: A Survey
Figure 2 for Jailbreak Attacks and Defenses Against Large Language Models: A Survey
Figure 3 for Jailbreak Attacks and Defenses Against Large Language Models: A Survey
Figure 4 for Jailbreak Attacks and Defenses Against Large Language Models: A Survey
Viaarxiv icon

On Evaluating The Performance of Watermarked Machine-Generated Texts Under Adversarial Attacks

Add code
Jul 05, 2024
Figure 1 for On Evaluating The Performance of Watermarked Machine-Generated Texts Under Adversarial Attacks
Figure 2 for On Evaluating The Performance of Watermarked Machine-Generated Texts Under Adversarial Attacks
Figure 3 for On Evaluating The Performance of Watermarked Machine-Generated Texts Under Adversarial Attacks
Figure 4 for On Evaluating The Performance of Watermarked Machine-Generated Texts Under Adversarial Attacks
Viaarxiv icon

JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models

Add code
Jun 13, 2024
Viaarxiv icon