Picture for Zhen Han

Zhen Han

ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling

Add code
Jan 07, 2025
Figure 1 for ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling
Figure 2 for ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling
Figure 3 for ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling
Figure 4 for ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling
Viaarxiv icon

HybGRAG: Hybrid Retrieval-Augmented Generation on Textual and Relational Knowledge Bases

Add code
Dec 20, 2024
Viaarxiv icon

PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model

Add code
Nov 12, 2024
Figure 1 for PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model
Figure 2 for PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model
Figure 3 for PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model
Figure 4 for PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model
Viaarxiv icon

Visual Question Decomposition on Multimodal Large Language Models

Add code
Sep 28, 2024
Figure 1 for Visual Question Decomposition on Multimodal Large Language Models
Figure 2 for Visual Question Decomposition on Multimodal Large Language Models
Figure 3 for Visual Question Decomposition on Multimodal Large Language Models
Figure 4 for Visual Question Decomposition on Multimodal Large Language Models
Viaarxiv icon

WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration

Add code
Aug 28, 2024
Viaarxiv icon

IDRetracor: Towards Visual Forensics Against Malicious Face Swapping

Add code
Aug 13, 2024
Figure 1 for IDRetracor: Towards Visual Forensics Against Malicious Face Swapping
Figure 2 for IDRetracor: Towards Visual Forensics Against Malicious Face Swapping
Figure 3 for IDRetracor: Towards Visual Forensics Against Malicious Face Swapping
Figure 4 for IDRetracor: Towards Visual Forensics Against Malicious Face Swapping
Viaarxiv icon

A rapid approach to urban traffic noise mapping with a generative adversarial network

Add code
May 21, 2024
Viaarxiv icon

StyleBooth: Image Style Editing with Multimodal Instruction

Add code
Apr 18, 2024
Viaarxiv icon

Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?

Add code
Apr 04, 2024
Figure 1 for Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?
Figure 2 for Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?
Figure 3 for Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?
Figure 4 for Red Teaming GPT-4V: Are GPT-4V Safe Against Uni/Multi-Modal Jailbreak Attacks?
Viaarxiv icon

Locate, Assign, Refine: Taming Customized Image Inpainting with Text-Subject Guidance

Add code
Mar 28, 2024
Viaarxiv icon