Picture for Henghui Ding

Henghui Ding

Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models

Add code
Nov 04, 2024
Viaarxiv icon

Transferable Adversarial Attacks on SAM and Its Downstream Models

Add code
Oct 29, 2024
Viaarxiv icon

How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?

Add code
Oct 23, 2024
Viaarxiv icon

LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation

Add code
Sep 09, 2024
Viaarxiv icon

3D-GRES: Generalized 3D Referring Expression Segmentation

Add code
Jul 31, 2024
Viaarxiv icon

RefMask3D: Language-Guided Transformer for 3D Referring Segmentation

Add code
Jul 25, 2024
Viaarxiv icon

SegPoint: Segment Any Point Cloud via Large Language Model

Add code
Jul 18, 2024
Viaarxiv icon

PECTP: Parameter-Efficient Cross-Task Prompts for Incremental Vision Transformer

Add code
Jul 04, 2024
Viaarxiv icon

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

Add code
Jun 24, 2024
Figure 1 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 2 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 3 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 4 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Viaarxiv icon

A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models

Add code
Jun 20, 2024
Viaarxiv icon