Picture for Jindong Gu

Jindong Gu

Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models

Add code
Mar 14, 2025
Viaarxiv icon

Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation

Add code
Mar 10, 2025
Viaarxiv icon

Improving Adversarial Transferability in MLLMs via Dynamic Vision-Language Alignment Attack

Add code
Feb 27, 2025
Viaarxiv icon

PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving

Add code
Feb 22, 2025
Viaarxiv icon

Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak

Add code
Jan 23, 2025
Viaarxiv icon

FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings

Add code
Jan 11, 2025
Figure 1 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 2 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 3 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 4 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Viaarxiv icon

Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs

Add code
Jan 11, 2025
Viaarxiv icon

SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation

Add code
Dec 13, 2024
Viaarxiv icon

Uncovering Vision Modality Threats in Image-to-Image Tasks

Add code
Dec 07, 2024
Viaarxiv icon

Benchmarking Open-ended Audio Dialogue Understanding for Large Audio-Language Models

Add code
Dec 06, 2024
Viaarxiv icon