Picture for Chi-Keung Tang

Chi-Keung Tang

HKUST

Multimodal Generation of Animatable 3D Human Models with AvatarForge

Add code
Mar 11, 2025
Viaarxiv icon

Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts

Add code
Mar 10, 2025
Viaarxiv icon

Dynamic Path Navigation for Motion Agents with LLM Reasoning

Add code
Mar 10, 2025
Viaarxiv icon

ReelWave: A Multi-Agent Framework Toward Professional Movie Sound Generation

Add code
Mar 10, 2025
Viaarxiv icon

WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents

Add code
Feb 21, 2025
Viaarxiv icon

UVRM: A Scalable 3D Reconstruction Model from Unposed Videos

Add code
Jan 16, 2025
Figure 1 for UVRM: A Scalable 3D Reconstruction Model from Unposed Videos
Figure 2 for UVRM: A Scalable 3D Reconstruction Model from Unposed Videos
Figure 3 for UVRM: A Scalable 3D Reconstruction Model from Unposed Videos
Figure 4 for UVRM: A Scalable 3D Reconstruction Model from Unposed Videos
Viaarxiv icon

Audio-Agent: Leveraging LLMs For Audio Generation, Editing and Composition

Add code
Oct 04, 2024
Viaarxiv icon

ChatCam: Empowering Camera Control through Conversational AI

Add code
Sep 25, 2024
Viaarxiv icon

VP-LLM: Text-Driven 3D Volume Completion with Large Language Models through Patchification

Add code
Jun 08, 2024
Figure 1 for VP-LLM: Text-Driven 3D Volume Completion with Large Language Models through Patchification
Figure 2 for VP-LLM: Text-Driven 3D Volume Completion with Large Language Models through Patchification
Figure 3 for VP-LLM: Text-Driven 3D Volume Completion with Large Language Models through Patchification
Figure 4 for VP-LLM: Text-Driven 3D Volume Completion with Large Language Models through Patchification
Viaarxiv icon

Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling

Add code
Jun 06, 2024
Viaarxiv icon