Picture for Jindong Gu

Jindong Gu

Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection

Add code
Apr 02, 2025
Viaarxiv icon

ShieldGemma 2: Robust and Tractable Image Content Moderation

Add code
Apr 01, 2025
Viaarxiv icon

Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models

Add code
Mar 14, 2025
Viaarxiv icon

Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation

Add code
Mar 10, 2025
Viaarxiv icon

Improving Adversarial Transferability in MLLMs via Dynamic Vision-Language Alignment Attack

Add code
Feb 27, 2025
Viaarxiv icon

PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving

Add code
Feb 22, 2025
Viaarxiv icon

Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak

Add code
Jan 23, 2025
Viaarxiv icon

Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs

Add code
Jan 11, 2025
Viaarxiv icon

FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings

Add code
Jan 11, 2025
Figure 1 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 2 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 3 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 4 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Viaarxiv icon

SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation

Add code
Dec 13, 2024
Viaarxiv icon