Picture for Tsung-Yi Ho

Tsung-Yi Ho

Steering Externalities: Benign Activation Steering Unintentionally Increases Jailbreak Risk for Large Language Models

Add code
Feb 03, 2026
Viaarxiv icon

Hey, That's My Data! Label-Only Dataset Inference in Large Language Models

Add code
Jun 06, 2025
Viaarxiv icon

Optimization-Free Universal Watermark Forgery with Regenerative Diffusion Models

Add code
Jun 06, 2025
Viaarxiv icon

Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets

Add code
Jun 05, 2025
Viaarxiv icon

Unsupervised Out-of-Distribution Detection in Medical Imaging Using Multi-Exit Class Activation Maps and Feature Masking

Add code
May 13, 2025
Viaarxiv icon

Retention Score: Quantifying Jailbreak Risks for Vision Language Models

Add code
Dec 23, 2024
Viaarxiv icon

Echo: Simulating Distributed Training At Scale

Add code
Dec 17, 2024
Figure 1 for Echo: Simulating Distributed Training At Scale
Figure 2 for Echo: Simulating Distributed Training At Scale
Figure 3 for Echo: Simulating Distributed Training At Scale
Figure 4 for Echo: Simulating Distributed Training At Scale
Viaarxiv icon

Defining and Evaluating Physical Safety for Large Language Models

Add code
Nov 04, 2024
Figure 1 for Defining and Evaluating Physical Safety for Large Language Models
Figure 2 for Defining and Evaluating Physical Safety for Large Language Models
Figure 3 for Defining and Evaluating Physical Safety for Large Language Models
Figure 4 for Defining and Evaluating Physical Safety for Large Language Models
Viaarxiv icon

Trustworthiness in Retrieval-Augmented Generation Systems: A Survey

Add code
Sep 16, 2024
Figure 1 for Trustworthiness in Retrieval-Augmented Generation Systems: A Survey
Figure 2 for Trustworthiness in Retrieval-Augmented Generation Systems: A Survey
Viaarxiv icon

When Does Visual Prompting Outperform Linear Probing for Vision-Language Models? A Likelihood Perspective

Add code
Sep 04, 2024
Figure 1 for When Does Visual Prompting Outperform Linear Probing for Vision-Language Models? A Likelihood Perspective
Figure 2 for When Does Visual Prompting Outperform Linear Probing for Vision-Language Models? A Likelihood Perspective
Figure 3 for When Does Visual Prompting Outperform Linear Probing for Vision-Language Models? A Likelihood Perspective
Figure 4 for When Does Visual Prompting Outperform Linear Probing for Vision-Language Models? A Likelihood Perspective
Viaarxiv icon