Picture for Atoosa Chegini

Atoosa Chegini

SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF

Add code
Nov 04, 2024
Figure 1 for SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
Figure 2 for SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
Figure 3 for SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
Figure 4 for SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
Viaarxiv icon

What do we learn from inverting CLIP models?

Add code
Mar 05, 2024
Figure 1 for What do we learn from inverting CLIP models?
Figure 2 for What do we learn from inverting CLIP models?
Figure 3 for What do we learn from inverting CLIP models?
Figure 4 for What do we learn from inverting CLIP models?
Viaarxiv icon

Fast Adversarial Attacks on Language Models In One GPU Minute

Add code
Feb 23, 2024
Figure 1 for Fast Adversarial Attacks on Language Models In One GPU Minute
Figure 2 for Fast Adversarial Attacks on Language Models In One GPU Minute
Figure 3 for Fast Adversarial Attacks on Language Models In One GPU Minute
Figure 4 for Fast Adversarial Attacks on Language Models In One GPU Minute
Viaarxiv icon

Identifying and Mitigating Model Failures through Few-shot CLIP-aided Diffusion Generation

Add code
Dec 09, 2023
Viaarxiv icon

Robustness of AI-Image Detectors: Fundamental Limits and Practical Attacks

Add code
Sep 29, 2023
Viaarxiv icon

Run-Off Election: Improved Provable Defense against Data Poisoning Attacks

Add code
Feb 05, 2023
Figure 1 for Run-Off Election: Improved Provable Defense against Data Poisoning Attacks
Figure 2 for Run-Off Election: Improved Provable Defense against Data Poisoning Attacks
Figure 3 for Run-Off Election: Improved Provable Defense against Data Poisoning Attacks
Figure 4 for Run-Off Election: Improved Provable Defense against Data Poisoning Attacks
Viaarxiv icon