Picture for Fan Lu

Fan Lu

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

Add code
Dec 12, 2024
Viaarxiv icon

Learning Visual Generative Priors without Text

Add code
Dec 10, 2024
Viaarxiv icon

GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos

Add code
Dec 03, 2024
Viaarxiv icon

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

Add code
Oct 07, 2024
Figure 1 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 2 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 3 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 4 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Viaarxiv icon

GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields

Add code
Jul 08, 2024
Viaarxiv icon

RCDN: Towards Robust Camera-Insensitivity Collaborative Perception via Dynamic Feature-based 3D Neural Modeling

Add code
May 27, 2024
Viaarxiv icon

Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior

Add code
Apr 10, 2024
Viaarxiv icon

LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis

Add code
Apr 03, 2024
Viaarxiv icon

DreamLIP: Language-Image Pre-training with Long Captions

Add code
Mar 25, 2024
Viaarxiv icon

PCDepth: Pattern-based Complementary Learning for Monocular Depth Estimation by Best of Both Worlds

Add code
Feb 29, 2024
Viaarxiv icon