Picture for Dan Xu

Dan Xu

F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Consistent Gaussian Splatting

Add code
Jan 12, 2025
Viaarxiv icon

Free-viewpoint Human Animation with Pose-correlated Reference Selection

Add code
Dec 23, 2024
Viaarxiv icon

Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation

Add code
Dec 01, 2024
Figure 1 for Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation
Figure 2 for Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation
Figure 3 for Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation
Figure 4 for Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation
Viaarxiv icon

Multi-Task Label Discovery via Hierarchical Task Tokens for Partially Annotated Dense Predictions

Add code
Nov 27, 2024
Viaarxiv icon

LeC$^2$O-NeRF: Learning Continuous and Compact Large-Scale Occupancy for Urban Scenes

Add code
Nov 18, 2024
Viaarxiv icon

MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding

Add code
Oct 29, 2024
Figure 1 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 2 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 3 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 4 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Viaarxiv icon

MM-Ego: Towards Building Egocentric Multimodal LLMs

Add code
Oct 09, 2024
Figure 1 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Figure 2 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Figure 3 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Figure 4 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Viaarxiv icon

3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection

Add code
Oct 02, 2024
Viaarxiv icon

DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis

Add code
Sep 16, 2024
Figure 1 for DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis
Figure 2 for DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis
Figure 3 for DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis
Figure 4 for DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis
Viaarxiv icon

Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling

Add code
Jul 16, 2024
Viaarxiv icon