Picture for Yuping Lin

Yuping Lin

Unpacking Political Bias in Large Language Models: Insights Across Topic Polarization

Add code
Dec 24, 2024
Viaarxiv icon

Towards Knowledge Checking in Retrieval-augmented Generation: A Representation Perspective

Add code
Nov 21, 2024
Viaarxiv icon

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis

Add code
Jun 16, 2024
Figure 1 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Figure 2 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Figure 3 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Figure 4 for Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis
Viaarxiv icon

FT-Shield: A Watermark Against Unauthorized Fine-tuning in Text-to-Image Diffusion Models

Add code
Oct 03, 2023
Viaarxiv icon

Bandlimiting Neural Networks Against Adversarial Attacks

Add code
May 30, 2019
Figure 1 for Bandlimiting Neural Networks Against Adversarial Attacks
Figure 2 for Bandlimiting Neural Networks Against Adversarial Attacks
Figure 3 for Bandlimiting Neural Networks Against Adversarial Attacks
Figure 4 for Bandlimiting Neural Networks Against Adversarial Attacks
Viaarxiv icon