Picture for Di Huang

Di Huang

Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learning

Add code
Jan 12, 2025
Viaarxiv icon

Generating Editable Head Avatars with 3D Gaussian GANs

Add code
Dec 26, 2024
Viaarxiv icon

GraphSeqLM: A Unified Graph Language Framework for Omic Graph Learning

Add code
Dec 20, 2024
Viaarxiv icon

3D$^2$-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling

Add code
Dec 16, 2024
Figure 1 for 3D$^2$-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling
Figure 2 for 3D$^2$-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling
Figure 3 for 3D$^2$-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling
Figure 4 for 3D$^2$-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling
Viaarxiv icon

World-Consistent Data Generation for Vision-and-Language Navigation

Add code
Dec 09, 2024
Viaarxiv icon

DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild

Add code
Nov 20, 2024
Figure 1 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Figure 2 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Figure 3 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Figure 4 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Viaarxiv icon

Constraint Learning for Parametric Point Cloud

Add code
Nov 12, 2024
Viaarxiv icon

Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery

Add code
Nov 05, 2024
Figure 1 for Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery
Figure 2 for Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery
Figure 3 for Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery
Figure 4 for Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery
Viaarxiv icon

EMMA: End-to-End Multimodal Model for Autonomous Driving

Add code
Oct 30, 2024
Figure 1 for EMMA: End-to-End Multimodal Model for Autonomous Driving
Figure 2 for EMMA: End-to-End Multimodal Model for Autonomous Driving
Figure 3 for EMMA: End-to-End Multimodal Model for Autonomous Driving
Figure 4 for EMMA: End-to-End Multimodal Model for Autonomous Driving
Viaarxiv icon

MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding

Add code
Oct 29, 2024
Figure 1 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 2 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 3 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 4 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Viaarxiv icon