Picture for Minzhou Pan

Minzhou Pan

SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations

Add code
Dec 09, 2024
Viaarxiv icon

AI Risk Categorization Decoded (AIR 2024): From Government Regulations to Corporate Policies

Add code
Jun 25, 2024
Viaarxiv icon

Evaluating and Mitigating IP Infringement in Visual Generative AI

Add code
Jun 07, 2024
Viaarxiv icon

JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits

Add code
Jun 06, 2024
Figure 1 for JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits
Figure 2 for JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits
Figure 3 for JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits
Figure 4 for JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits
Viaarxiv icon

Finding needles in a haystack: A Black-Box Approach to Invisible Watermark Detection

Add code
Mar 30, 2024
Viaarxiv icon

ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning Paradigms

Add code
Feb 22, 2023
Viaarxiv icon

How to Sift Out a Clean Data Subset in the Presence of Data Poisoning?

Add code
Oct 12, 2022
Figure 1 for How to Sift Out a Clean Data Subset in the Presence of Data Poisoning?
Figure 2 for How to Sift Out a Clean Data Subset in the Presence of Data Poisoning?
Figure 3 for How to Sift Out a Clean Data Subset in the Presence of Data Poisoning?
Figure 4 for How to Sift Out a Clean Data Subset in the Presence of Data Poisoning?
Viaarxiv icon

Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information

Add code
Apr 15, 2022
Figure 1 for Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information
Figure 2 for Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information
Figure 3 for Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information
Figure 4 for Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information
Viaarxiv icon