Picture for Jizhong Han

Jizhong Han

Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation

Add code
Dec 12, 2024
Viaarxiv icon

The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models

Add code
Nov 18, 2024
Viaarxiv icon

FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction

Add code
Sep 26, 2024
Viaarxiv icon

Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding

Add code
Sep 12, 2024
Figure 1 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 2 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 3 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 4 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Viaarxiv icon

AdaPPA: Adaptive Position Pre-Fill Jailbreak Attack Approach Targeting LLMs

Add code
Sep 11, 2024
Viaarxiv icon

Learning to Discover Forgery Cues for Face Forgery Detection

Add code
Sep 02, 2024
Figure 1 for Learning to Discover Forgery Cues for Face Forgery Detection
Figure 2 for Learning to Discover Forgery Cues for Face Forgery Detection
Figure 3 for Learning to Discover Forgery Cues for Face Forgery Detection
Figure 4 for Learning to Discover Forgery Cues for Face Forgery Detection
Viaarxiv icon

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

Add code
Aug 28, 2024
Viaarxiv icon

Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models

Add code
Jul 18, 2024
Viaarxiv icon

Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens

Add code
Jun 19, 2024
Figure 1 for Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Figure 2 for Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Figure 3 for Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Figure 4 for Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Viaarxiv icon

Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM

Add code
May 09, 2024
Viaarxiv icon