Picture for Jizhong Han

Jizhong Han

Anchor3DLane++: 3D Lane Detection via Sample-Adaptive Sparse 3D Anchor Regression

Add code
Dec 22, 2024
Viaarxiv icon

Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation

Add code
Dec 12, 2024
Viaarxiv icon

The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models

Add code
Nov 18, 2024
Viaarxiv icon

FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction

Add code
Sep 26, 2024
Viaarxiv icon

Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding

Add code
Sep 12, 2024
Figure 1 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 2 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 3 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 4 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Viaarxiv icon

AdaPPA: Adaptive Position Pre-Fill Jailbreak Attack Approach Targeting LLMs

Add code
Sep 11, 2024
Viaarxiv icon

Learning to Discover Forgery Cues for Face Forgery Detection

Add code
Sep 02, 2024
Figure 1 for Learning to Discover Forgery Cues for Face Forgery Detection
Figure 2 for Learning to Discover Forgery Cues for Face Forgery Detection
Figure 3 for Learning to Discover Forgery Cues for Face Forgery Detection
Figure 4 for Learning to Discover Forgery Cues for Face Forgery Detection
Viaarxiv icon

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

Add code
Aug 28, 2024
Figure 1 for Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Figure 2 for Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Figure 3 for Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Figure 4 for Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation
Viaarxiv icon

Unveiling Structural Memorization: Structural Membership Inference Attack for Text-to-Image Diffusion Models

Add code
Jul 18, 2024
Viaarxiv icon

Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens

Add code
Jun 19, 2024
Figure 1 for Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Figure 2 for Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Figure 3 for Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Figure 4 for Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
Viaarxiv icon