Picture for Jindong Gu

Jindong Gu

FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models

Add code
Oct 07, 2024
Viaarxiv icon

Visual Question Decomposition on Multimodal Large Language Models

Add code
Sep 28, 2024
Figure 1 for Visual Question Decomposition on Multimodal Large Language Models
Figure 2 for Visual Question Decomposition on Multimodal Large Language Models
Figure 3 for Visual Question Decomposition on Multimodal Large Language Models
Figure 4 for Visual Question Decomposition on Multimodal Large Language Models
Viaarxiv icon

Multimodal Pragmatic Jailbreak on Text-to-image Models

Add code
Sep 27, 2024
Viaarxiv icon

RT-Attack: Jailbreaking Text-to-Image Models via Random Token

Add code
Aug 27, 2024
Viaarxiv icon

Can Editing LLMs Inject Harm?

Add code
Jul 29, 2024
Viaarxiv icon

Dataset Distillation by Automatic Training Trajectories

Add code
Jul 19, 2024
Viaarxiv icon

MMRo: Are Multimodal LLMs Eligible as the Brain for In-Home Robotics?

Add code
Jun 28, 2024
Viaarxiv icon

Localizing Events in Videos with Multimodal Queries

Add code
Jun 14, 2024
Viaarxiv icon

Provably Better Explanations with Optimized Aggregation of Feature Attributions

Add code
Jun 07, 2024
Viaarxiv icon

Learning Visual Prompts for Guiding the Attention of Vision Transformers

Add code
Jun 05, 2024
Viaarxiv icon