Picture for Henghui Ding

Henghui Ding

Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions

Add code
Jan 03, 2025
Viaarxiv icon

Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension

Add code
Jan 02, 2025
Viaarxiv icon

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers

Add code
Dec 17, 2024
Viaarxiv icon

Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models

Add code
Nov 04, 2024
Figure 1 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Figure 2 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Figure 3 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Figure 4 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Viaarxiv icon

Transferable Adversarial Attacks on SAM and Its Downstream Models

Add code
Oct 29, 2024
Viaarxiv icon

How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?

Add code
Oct 23, 2024
Figure 1 for How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?
Figure 2 for How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?
Figure 3 for How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?
Figure 4 for How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?
Viaarxiv icon

LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation

Add code
Sep 09, 2024
Figure 1 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Figure 2 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Figure 3 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Figure 4 for LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation
Viaarxiv icon

3D-GRES: Generalized 3D Referring Expression Segmentation

Add code
Jul 31, 2024
Figure 1 for 3D-GRES: Generalized 3D Referring Expression Segmentation
Figure 2 for 3D-GRES: Generalized 3D Referring Expression Segmentation
Figure 3 for 3D-GRES: Generalized 3D Referring Expression Segmentation
Figure 4 for 3D-GRES: Generalized 3D Referring Expression Segmentation
Viaarxiv icon

RefMask3D: Language-Guided Transformer for 3D Referring Segmentation

Add code
Jul 25, 2024
Viaarxiv icon

SegPoint: Segment Any Point Cloud via Large Language Model

Add code
Jul 18, 2024
Viaarxiv icon