Picture for Tsung-Yi Ho

Tsung-Yi Ho

Defining and Evaluating Physical Safety for Large Language Models

Add code
Nov 04, 2024
Viaarxiv icon

Trustworthiness in Retrieval-Augmented Generation Systems: A Survey

Add code
Sep 16, 2024
Viaarxiv icon

When Does Visual Prompting Outperform Linear Probing for Vision-Language Models? A Likelihood Perspective

Add code
Sep 04, 2024
Viaarxiv icon

The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Pre-trained Language Models

Add code
Jun 14, 2024
Viaarxiv icon

RIGID: A Training-free and Model-Agnostic Framework for Robust AI-Generated Image Detection

Add code
May 30, 2024
Viaarxiv icon

Achieving Fairness Through Channel Pruning for Dermatological Disease Diagnosis

Add code
May 14, 2024
Viaarxiv icon

TroLLoc: Logic Locking and Layout Hardening for IC Security Closure against Hardware Trojans

Add code
May 09, 2024
Viaarxiv icon

NaNa and MiGu: Semantic Data Augmentation Techniques to Enhance Protein Classification in Graph Neural Networks

Add code
Mar 26, 2024
Viaarxiv icon

Evaluating Text-to-Image Generative Models: An Empirical Study on Human Image Synthesis

Add code
Mar 08, 2024
Viaarxiv icon

Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes

Add code
Mar 05, 2024
Viaarxiv icon