Picture for Guanbin Li

Guanbin Li

Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method

Add code
Dec 12, 2024
Viaarxiv icon

DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh

Add code
Nov 20, 2024
Viaarxiv icon

GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering

Add code
Oct 31, 2024
Viaarxiv icon

Let Video Teaches You More: Video-to-Image Knowledge Distillation using DEtection TRansformer for Medical Video Lesion Detection

Add code
Aug 26, 2024
Viaarxiv icon

SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation

Add code
Aug 16, 2024
Viaarxiv icon

High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model

Add code
Aug 10, 2024
Figure 1 for High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Figure 2 for High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Figure 3 for High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Figure 4 for High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Viaarxiv icon

Style-Preserving Lip Sync via Audio-Aware Style Reference

Add code
Aug 10, 2024
Viaarxiv icon

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection

Add code
Jul 31, 2024
Viaarxiv icon

ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation

Add code
Jul 23, 2024
Viaarxiv icon

WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models

Add code
Jul 15, 2024
Viaarxiv icon