Picture for Hujun Bao

Hujun Bao

Zhejiang University

Acquisition through My Eyes and Steps: A Joint Predictive Agent Model in Egocentric Worlds

Add code
Feb 09, 2025
Viaarxiv icon

XR-VIO: High-precision Visual Inertial Odometry with Fast Initialization for XR Applications

Add code
Feb 03, 2025
Viaarxiv icon

MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training

Add code
Jan 13, 2025
Viaarxiv icon

MoEE: Mixture of Emotion Experts for Audio-Driven Portrait Animation

Add code
Jan 03, 2025
Viaarxiv icon

GURecon: Learning Detailed 3D Geometric Uncertainties for Neural Surface Reconstruction

Add code
Dec 19, 2024
Viaarxiv icon

EnvGS: Modeling View-Dependent Appearance with Environment Gaussian

Add code
Dec 19, 2024
Figure 1 for EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
Figure 2 for EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
Figure 3 for EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
Figure 4 for EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
Viaarxiv icon

Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation

Add code
Dec 18, 2024
Viaarxiv icon

StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models

Add code
Dec 17, 2024
Figure 1 for StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models
Figure 2 for StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models
Figure 3 for StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models
Figure 4 for StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models
Viaarxiv icon

CFSynthesis: Controllable and Free-view 3D Human Video Synthesis

Add code
Dec 17, 2024
Figure 1 for CFSynthesis: Controllable and Free-view 3D Human Video Synthesis
Figure 2 for CFSynthesis: Controllable and Free-view 3D Human Video Synthesis
Figure 3 for CFSynthesis: Controllable and Free-view 3D Human Video Synthesis
Figure 4 for CFSynthesis: Controllable and Free-view 3D Human Video Synthesis
Viaarxiv icon

Representing Long Volumetric Video with Temporal Gaussian Hierarchy

Add code
Dec 12, 2024
Viaarxiv icon