Picture for Michael Backes

Michael Backes

Captured by Captions: On Memorization and its Mitigation in CLIP Models

Add code
Feb 11, 2025
Viaarxiv icon

Synthetic Artifact Auditing: Tracing LLM-Generated Synthetic Data Usage in Downstream Applications

Add code
Feb 02, 2025
Viaarxiv icon

HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns

Add code
Jan 28, 2025
Viaarxiv icon

DivTrackee versus DynTracker: Promoting Diversity in Anti-Facial Recognition against Dynamic FR Strategy

Add code
Jan 11, 2025
Viaarxiv icon

SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation

Add code
Jan 03, 2025
Figure 1 for SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation
Figure 2 for SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation
Figure 3 for SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation
Figure 4 for SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation
Viaarxiv icon

Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media

Add code
Dec 24, 2024
Viaarxiv icon

Localizing Memorization in SSL Vision Encoders

Add code
Sep 27, 2024
Figure 1 for Localizing Memorization in SSL Vision Encoders
Figure 2 for Localizing Memorization in SSL Vision Encoders
Figure 3 for Localizing Memorization in SSL Vision Encoders
Figure 4 for Localizing Memorization in SSL Vision Encoders
Viaarxiv icon

Understanding Data Importance in Machine Learning Attacks: Does Valuable Data Pose Greater Harm?

Add code
Sep 05, 2024
Viaarxiv icon

Membership Inference Attacks Against In-Context Learning

Add code
Sep 02, 2024
Viaarxiv icon

Image-Perfect Imperfections: Safety, Bias, and Authenticity in the Shadow of Text-To-Image Model Evolution

Add code
Aug 30, 2024
Viaarxiv icon