Picture for Guanbin Li

Guanbin Li

Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal

Add code
Apr 07, 2025
Viaarxiv icon

Empowering Large Language Models with 3D Situation Awareness

Add code
Mar 29, 2025
Viaarxiv icon

DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode

Add code
Mar 17, 2025
Viaarxiv icon

VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction

Add code
Mar 15, 2025
Viaarxiv icon

Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering

Add code
Mar 14, 2025
Viaarxiv icon

Aerial Vision-and-Language Navigation with Grid-based View Selection and Map Construction

Add code
Mar 14, 2025
Viaarxiv icon

EgoSplat: Open-Vocabulary Egocentric Scene Understanding with Language Embedded 3D Gaussian Splatting

Add code
Mar 14, 2025
Viaarxiv icon

DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering

Add code
Mar 06, 2025
Viaarxiv icon

Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration Decoding

Add code
Jan 03, 2025
Viaarxiv icon

MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation

Add code
Dec 16, 2024
Figure 1 for MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
Figure 2 for MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
Figure 3 for MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
Figure 4 for MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
Viaarxiv icon