Picture for Dan Xu

Dan Xu

Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation

Add code
Dec 01, 2024
Viaarxiv icon

Multi-Task Label Discovery via Hierarchical Task Tokens for Partially Annotated Dense Predictions

Add code
Nov 27, 2024
Viaarxiv icon

LeC$^2$O-NeRF: Learning Continuous and Compact Large-Scale Occupancy for Urban Scenes

Add code
Nov 18, 2024
Viaarxiv icon

MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding

Add code
Oct 29, 2024
Figure 1 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 2 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 3 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 4 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Viaarxiv icon

MM-Ego: Towards Building Egocentric Multimodal LLMs

Add code
Oct 09, 2024
Figure 1 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Figure 2 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Figure 3 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Figure 4 for MM-Ego: Towards Building Egocentric Multimodal LLMs
Viaarxiv icon

3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection

Add code
Oct 02, 2024
Viaarxiv icon

DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis

Add code
Sep 16, 2024
Figure 1 for DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis
Figure 2 for DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis
Figure 3 for DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis
Figure 4 for DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis
Viaarxiv icon

Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling

Add code
Jul 16, 2024
Viaarxiv icon

Learning Online Scale Transformation for Talking Head Video Generation

Add code
Jul 13, 2024
Viaarxiv icon

Sample-efficient Imitative Multi-token Decision Transformer for Generalizable Real World Driving

Add code
Jun 18, 2024
Figure 1 for Sample-efficient Imitative Multi-token Decision Transformer for Generalizable Real World Driving
Figure 2 for Sample-efficient Imitative Multi-token Decision Transformer for Generalizable Real World Driving
Figure 3 for Sample-efficient Imitative Multi-token Decision Transformer for Generalizable Real World Driving
Figure 4 for Sample-efficient Imitative Multi-token Decision Transformer for Generalizable Real World Driving
Viaarxiv icon