Picture for Di Huang

Di Huang

World-Consistent Data Generation for Vision-and-Language Navigation

Add code
Dec 09, 2024
Viaarxiv icon

DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild

Add code
Nov 20, 2024
Figure 1 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Figure 2 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Figure 3 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Figure 4 for DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild
Viaarxiv icon

Constraint Learning for Parametric Point Cloud

Add code
Nov 12, 2024
Viaarxiv icon

Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery

Add code
Nov 05, 2024
Figure 1 for Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery
Figure 2 for Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery
Figure 3 for Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery
Figure 4 for Centerness-based Instance-aware Knowledge Distillation with Task-wise Mutual Lifting for Object Detection on Drone Imagery
Viaarxiv icon

EMMA: End-to-End Multimodal Model for Autonomous Driving

Add code
Oct 30, 2024
Figure 1 for EMMA: End-to-End Multimodal Model for Autonomous Driving
Figure 2 for EMMA: End-to-End Multimodal Model for Autonomous Driving
Figure 3 for EMMA: End-to-End Multimodal Model for Autonomous Driving
Figure 4 for EMMA: End-to-End Multimodal Model for Autonomous Driving
Viaarxiv icon

MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding

Add code
Oct 29, 2024
Figure 1 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 2 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 3 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Figure 4 for MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding
Viaarxiv icon

Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction

Add code
Oct 24, 2024
Figure 1 for Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction
Figure 2 for Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction
Figure 3 for Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction
Figure 4 for Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction
Viaarxiv icon

FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model

Add code
Oct 17, 2024
Figure 1 for FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
Figure 2 for FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
Figure 3 for FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
Figure 4 for FiTv2: Scalable and Improved Flexible Vision Transformer for Diffusion Model
Viaarxiv icon

Depth Any Video with Scalable Synthetic Data

Add code
Oct 14, 2024
Figure 1 for Depth Any Video with Scalable Synthetic Data
Figure 2 for Depth Any Video with Scalable Synthetic Data
Figure 3 for Depth Any Video with Scalable Synthetic Data
Figure 4 for Depth Any Video with Scalable Synthetic Data
Viaarxiv icon

HealthQ: Unveiling Questioning Capabilities of LLM Chains in Healthcare Conversations

Add code
Sep 28, 2024
Figure 1 for HealthQ: Unveiling Questioning Capabilities of LLM Chains in Healthcare Conversations
Figure 2 for HealthQ: Unveiling Questioning Capabilities of LLM Chains in Healthcare Conversations
Figure 3 for HealthQ: Unveiling Questioning Capabilities of LLM Chains in Healthcare Conversations
Figure 4 for HealthQ: Unveiling Questioning Capabilities of LLM Chains in Healthcare Conversations
Viaarxiv icon