Picture for Fan Lu

Fan Lu

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

Add code
Dec 12, 2024
Viaarxiv icon

Learning Visual Generative Priors without Text

Add code
Dec 10, 2024
Figure 1 for Learning Visual Generative Priors without Text
Figure 2 for Learning Visual Generative Priors without Text
Figure 3 for Learning Visual Generative Priors without Text
Figure 4 for Learning Visual Generative Priors without Text
Viaarxiv icon

GSGTrack: Gaussian Splatting-Guided Object Pose Tracking from RGB Videos

Add code
Dec 03, 2024
Viaarxiv icon

LoTLIP: Improving Language-Image Pre-training for Long Text Understanding

Add code
Oct 07, 2024
Figure 1 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 2 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 3 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Figure 4 for LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
Viaarxiv icon

GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields

Add code
Jul 08, 2024
Viaarxiv icon

RCDN: Towards Robust Camera-Insensitivity Collaborative Perception via Dynamic Feature-based 3D Neural Modeling

Add code
May 27, 2024
Viaarxiv icon

Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior

Add code
Apr 10, 2024
Viaarxiv icon

LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis

Add code
Apr 03, 2024
Viaarxiv icon

DreamLIP: Language-Image Pre-training with Long Captions

Add code
Mar 25, 2024
Viaarxiv icon

PCDepth: Pattern-based Complementary Learning for Monocular Depth Estimation by Best of Both Worlds

Add code
Feb 29, 2024
Viaarxiv icon