Picture for Shuirong Cao

Shuirong Cao

BAMBA: A Bimodal Adversarial Multi-Round Black-Box Jailbreak Attacker for LVLMs

Add code
Dec 08, 2024
Viaarxiv icon

Gibberish is All You Need for Membership Inference Detection in Contrastive Language-Audio Pretraining

Add code
Nov 02, 2024
Viaarxiv icon

A Unimodal Speaker-Level Membership Inference Detector for Contrastive Pretraining

Add code
Oct 24, 2024
Viaarxiv icon

AGR: Age Group fairness Reward for Bias Mitigation in LLMs

Add code
Sep 06, 2024
Viaarxiv icon

RLRF:Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs

Add code
Apr 28, 2024
Figure 1 for RLRF:Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs
Figure 2 for RLRF:Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs
Figure 3 for RLRF:Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs
Figure 4 for RLRF:Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs
Viaarxiv icon

Deceiving to Enlighten: Coaxing LLMs to Self-Reflection for Enhanced Bias Detection and Mitigation

Add code
Apr 15, 2024
Figure 1 for Deceiving to Enlighten: Coaxing LLMs to Self-Reflection for Enhanced Bias Detection and Mitigation
Figure 2 for Deceiving to Enlighten: Coaxing LLMs to Self-Reflection for Enhanced Bias Detection and Mitigation
Figure 3 for Deceiving to Enlighten: Coaxing LLMs to Self-Reflection for Enhanced Bias Detection and Mitigation
Figure 4 for Deceiving to Enlighten: Coaxing LLMs to Self-Reflection for Enhanced Bias Detection and Mitigation
Viaarxiv icon

Revisiting the Role of Similarity and Dissimilarity in Best Counter Argument Retrieval

Add code
Apr 19, 2023
Viaarxiv icon