Picture for Zhen Dong

Zhen Dong

SaliencyI2PLoc: saliency-guided image-point cloud localization using contrastive learning

Add code
Dec 20, 2024
Viaarxiv icon

GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting

Add code
Dec 18, 2024
Viaarxiv icon

Align3R: Aligned Monocular Depth Estimation for Dynamic Videos

Add code
Dec 05, 2024
Viaarxiv icon

OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances

Add code
Nov 13, 2024
Figure 1 for OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances
Figure 2 for OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances
Figure 3 for OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances
Figure 4 for OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Geometric and Semantic Guidances
Viaarxiv icon

Reliable-loc: Robust sequential LiDAR global localization in large-scale street scenes based on verifiable cues

Add code
Nov 09, 2024
Viaarxiv icon

Stochastic Communication Avoidance for Recommendation Systems

Add code
Nov 03, 2024
Figure 1 for Stochastic Communication Avoidance for Recommendation Systems
Figure 2 for Stochastic Communication Avoidance for Recommendation Systems
Figure 3 for Stochastic Communication Avoidance for Recommendation Systems
Figure 4 for Stochastic Communication Avoidance for Recommendation Systems
Viaarxiv icon

DQRM: Deep Quantized Recommendation Models

Add code
Oct 26, 2024
Viaarxiv icon

VistaDream: Sampling multiview consistent images for single-view scene reconstruction

Add code
Oct 22, 2024
Viaarxiv icon

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Add code
Oct 10, 2024
Figure 1 for Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Figure 2 for Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Figure 3 for Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Figure 4 for Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Viaarxiv icon

Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras

Add code
Sep 27, 2024
Figure 1 for Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras
Figure 2 for Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras
Figure 3 for Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras
Figure 4 for Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras
Viaarxiv icon