Picture for Yu-Wing Tai

Yu-Wing Tai

Tencent

FusionSegReID: Advancing Person Re-Identification with Multimodal Retrieval and Precise Segmentation

Add code
Mar 27, 2025
Viaarxiv icon

Multimodal Generation of Animatable 3D Human Models with AvatarForge

Add code
Mar 11, 2025
Viaarxiv icon

ReelWave: A Multi-Agent Framework Toward Professional Movie Sound Generation

Add code
Mar 10, 2025
Viaarxiv icon

Dynamic Path Navigation for Motion Agents with LLM Reasoning

Add code
Mar 10, 2025
Viaarxiv icon

Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts

Add code
Mar 10, 2025
Viaarxiv icon

WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents

Add code
Feb 21, 2025
Viaarxiv icon

UVRM: A Scalable 3D Reconstruction Model from Unposed Videos

Add code
Jan 16, 2025
Figure 1 for UVRM: A Scalable 3D Reconstruction Model from Unposed Videos
Figure 2 for UVRM: A Scalable 3D Reconstruction Model from Unposed Videos
Figure 3 for UVRM: A Scalable 3D Reconstruction Model from Unposed Videos
Figure 4 for UVRM: A Scalable 3D Reconstruction Model from Unposed Videos
Viaarxiv icon

Audio-Agent: Leveraging LLMs For Audio Generation, Editing and Composition

Add code
Oct 04, 2024
Viaarxiv icon

Reward-RAG: Enhancing RAG with Reward Driven Supervision

Add code
Oct 03, 2024
Figure 1 for Reward-RAG: Enhancing RAG with Reward Driven Supervision
Figure 2 for Reward-RAG: Enhancing RAG with Reward Driven Supervision
Figure 3 for Reward-RAG: Enhancing RAG with Reward Driven Supervision
Figure 4 for Reward-RAG: Enhancing RAG with Reward Driven Supervision
Viaarxiv icon

ChatCam: Empowering Camera Control through Conversational AI

Add code
Sep 25, 2024
Viaarxiv icon