Picture for Hongyuan Zhu

Hongyuan Zhu

Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark

Add code
Dec 03, 2024
Viaarxiv icon

Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image

Add code
Oct 20, 2024
Viaarxiv icon

PointCloud-Text Matching: Benchmark Datasets and a Baseline

Add code
Mar 28, 2024
Figure 1 for PointCloud-Text Matching: Benchmark Datasets and a Baseline
Figure 2 for PointCloud-Text Matching: Benchmark Datasets and a Baseline
Figure 3 for PointCloud-Text Matching: Benchmark Datasets and a Baseline
Figure 4 for PointCloud-Text Matching: Benchmark Datasets and a Baseline
Viaarxiv icon

Contributing Dimension Structure of Deep Feature for Coreset Selection

Add code
Jan 29, 2024
Viaarxiv icon

Direct Distillation between Different Domains

Add code
Jan 12, 2024
Viaarxiv icon

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts

Add code
Dec 17, 2023
Figure 1 for M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Figure 2 for M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Figure 3 for M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Figure 4 for M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
Viaarxiv icon

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning

Add code
Nov 30, 2023
Figure 1 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 2 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 3 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 4 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Viaarxiv icon

Exploit the antenna response consistency to define the alignment criteria for CSI data

Add code
Oct 10, 2023
Viaarxiv icon

Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention

Add code
Sep 17, 2023
Viaarxiv icon

Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning

Add code
Sep 06, 2023
Viaarxiv icon