Picture for Kejiang Chen

Kejiang Chen

AutoPT: How Far Are We from the End2End Automated Web Penetration Testing?

Add code
Nov 02, 2024
Viaarxiv icon

A Closer Look at Machine Unlearning for Large Language Models

Add code
Oct 10, 2024
Figure 1 for A Closer Look at Machine Unlearning for Large Language Models
Figure 2 for A Closer Look at Machine Unlearning for Large Language Models
Figure 3 for A Closer Look at Machine Unlearning for Large Language Models
Figure 4 for A Closer Look at Machine Unlearning for Large Language Models
Viaarxiv icon

Natias: Neuron Attribution based Transferable Image Adversarial Steganography

Add code
Sep 08, 2024
Viaarxiv icon

Prefix Guidance: A Steering Wheel for Large Language Models to Defend Against Jailbreak Attacks

Add code
Aug 22, 2024
Viaarxiv icon

Gaussian Shading: Provable Performance-Lossless Image Watermarking for Diffusion Models

Add code
Apr 07, 2024
Viaarxiv icon

Provably Secure Disambiguating Neural Linguistic Steganography

Add code
Mar 26, 2024
Viaarxiv icon

Data-Free Hard-Label Robustness Stealing Attack

Add code
Dec 12, 2023
Viaarxiv icon

Control Risk for Potential Misuse of Artificial Intelligence in Science

Add code
Dec 11, 2023
Viaarxiv icon

GPT Paternity Test: GPT Generated Text Detection with GPT Genetic Inheritance

Add code
May 21, 2023
Viaarxiv icon

Watermarking Text Generated by Black-Box Language Models

Add code
May 14, 2023
Viaarxiv icon