Picture for Ruoxi Cheng

Ruoxi Cheng

Southeast University

BAMBA: A Bimodal Adversarial Multi-Round Black-Box Jailbreak Attacker for LVLMs

Add code
Dec 08, 2024
Figure 1 for BAMBA: A Bimodal Adversarial Multi-Round Black-Box Jailbreak Attacker for LVLMs
Figure 2 for BAMBA: A Bimodal Adversarial Multi-Round Black-Box Jailbreak Attacker for LVLMs
Figure 3 for BAMBA: A Bimodal Adversarial Multi-Round Black-Box Jailbreak Attacker for LVLMs
Figure 4 for BAMBA: A Bimodal Adversarial Multi-Round Black-Box Jailbreak Attacker for LVLMs
Viaarxiv icon

SelfPrompt: Autonomously Evaluating LLM Robustness via Domain-Constrained Knowledge Guidelines and Refined Adversarial Prompts

Add code
Dec 01, 2024
Viaarxiv icon

Gibberish is All You Need for Membership Inference Detection in Contrastive Language-Audio Pretraining

Add code
Nov 02, 2024
Viaarxiv icon

A Unimodal Speaker-Level Membership Inference Detector for Contrastive Pretraining

Add code
Oct 24, 2024
Viaarxiv icon

AGR: Age Group fairness Reward for Bias Mitigation in LLMs

Add code
Sep 06, 2024
Figure 1 for AGR: Age Group fairness Reward for Bias Mitigation in LLMs
Figure 2 for AGR: Age Group fairness Reward for Bias Mitigation in LLMs
Figure 3 for AGR: Age Group fairness Reward for Bias Mitigation in LLMs
Figure 4 for AGR: Age Group fairness Reward for Bias Mitigation in LLMs
Viaarxiv icon

KGPA: Robustness Evaluation for Large Language Models via Cross-Domain Knowledge Graphs

Add code
Jun 16, 2024
Viaarxiv icon

Identity Inference from CLIP Models using Only Textual Data

Add code
May 23, 2024
Viaarxiv icon

RLRF:Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs

Add code
Apr 28, 2024
Figure 1 for RLRF:Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs
Figure 2 for RLRF:Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs
Figure 3 for RLRF:Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs
Figure 4 for RLRF:Reinforcement Learning from Reflection through Debates as Feedback for Bias Mitigation in LLMs
Viaarxiv icon

Deceiving to Enlighten: Coaxing LLMs to Self-Reflection for Enhanced Bias Detection and Mitigation

Add code
Apr 15, 2024
Figure 1 for Deceiving to Enlighten: Coaxing LLMs to Self-Reflection for Enhanced Bias Detection and Mitigation
Figure 2 for Deceiving to Enlighten: Coaxing LLMs to Self-Reflection for Enhanced Bias Detection and Mitigation
Figure 3 for Deceiving to Enlighten: Coaxing LLMs to Self-Reflection for Enhanced Bias Detection and Mitigation
Figure 4 for Deceiving to Enlighten: Coaxing LLMs to Self-Reflection for Enhanced Bias Detection and Mitigation
Viaarxiv icon