Picture for Kailong Wang

Kailong Wang

Efficient and Effective Universal Adversarial Attack against Vision-Language Pre-training Models

Add code
Oct 15, 2024
Viaarxiv icon

GlitchProber: Advancing Effective Detection and Mitigation of Glitch Tokens in Large Language Models

Add code
Aug 09, 2024
Viaarxiv icon

NeuSemSlice: Towards Effective DNN Model Maintenance via Neuron-level Semantic Slicing

Add code
Jul 26, 2024
Viaarxiv icon

Continuous Embedding Attacks via Clipped Inputs in Jailbreaking Large Language Models

Add code
Jul 16, 2024
Viaarxiv icon

Lockpicking LLMs: A Logit-Based Jailbreak Using Token-level Manipulation

Add code
May 20, 2024
Viaarxiv icon

Large Language Models for Cyber Security: A Systematic Literature Review

Add code
May 08, 2024
Viaarxiv icon

Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection

Add code
Apr 19, 2024
Viaarxiv icon

Beyond Fidelity: Explaining Vulnerability Localization of Learning-based Detectors

Add code
Jan 05, 2024
Viaarxiv icon

Digger: Detecting Copyright Content Mis-usage in Large Language Model Training

Add code
Jan 01, 2024
Viaarxiv icon

Large Language Models for Software Engineering: A Systematic Literature Review

Add code
Sep 12, 2023
Viaarxiv icon