Picture for Muhao Chen

Muhao Chen

ThinkGuard: Deliberative Slow Thinking Leads to Cautious Guardrails

Add code
Feb 19, 2025
Viaarxiv icon

AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection

Add code
Feb 18, 2025
Viaarxiv icon

Unraveling Indirect In-Context Learning Using Influence Functions

Add code
Jan 01, 2025
Figure 1 for Unraveling Indirect In-Context Learning Using Influence Functions
Figure 2 for Unraveling Indirect In-Context Learning Using Influence Functions
Figure 3 for Unraveling Indirect In-Context Learning Using Influence Functions
Figure 4 for Unraveling Indirect In-Context Learning Using Influence Functions
Viaarxiv icon

MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design

Add code
Dec 20, 2024
Figure 1 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Figure 2 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Figure 3 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Figure 4 for MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Viaarxiv icon

SleeperMark: Towards Robust Watermark against Fine-Tuning Text-to-image Diffusion Models

Add code
Dec 06, 2024
Figure 1 for SleeperMark: Towards Robust Watermark against Fine-Tuning Text-to-image Diffusion Models
Figure 2 for SleeperMark: Towards Robust Watermark against Fine-Tuning Text-to-image Diffusion Models
Figure 3 for SleeperMark: Towards Robust Watermark against Fine-Tuning Text-to-image Diffusion Models
Figure 4 for SleeperMark: Towards Robust Watermark against Fine-Tuning Text-to-image Diffusion Models
Viaarxiv icon

Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset

Add code
Nov 05, 2024
Figure 1 for Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Figure 2 for Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Figure 3 for Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Figure 4 for Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
Viaarxiv icon

An Untethered Bioinspired Robotic Tensegrity Dolphin with Multi-Flexibility Design for Aquatic Locomotion

Add code
Nov 01, 2024
Viaarxiv icon

FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks

Add code
Oct 28, 2024
Figure 1 for FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks
Figure 2 for FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks
Figure 3 for FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks
Figure 4 for FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks
Viaarxiv icon

SoftSnap: Rapid Prototyping of Untethered Soft Robots Using Snap-Together Modules

Add code
Oct 24, 2024
Viaarxiv icon

SudoLM: Learning Access Control of Parametric Knowledge with Authorization Alignment

Add code
Oct 18, 2024
Viaarxiv icon