Picture for Xiang Deng

Xiang Deng

Mark

Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation

Add code
Mar 13, 2025
Viaarxiv icon

Embodied Crowd Counting

Add code
Mar 11, 2025
Viaarxiv icon

3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds

Add code
Feb 27, 2025
Viaarxiv icon

Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts

Add code
Oct 31, 2024
Figure 1 for Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts
Figure 2 for Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts
Figure 3 for Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts
Figure 4 for Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts
Viaarxiv icon

EPD: Long-term Memory Extraction, Context-awared Planning and Multi-iteration Decision @ EgoPlan Challenge ICML 2024

Add code
Jul 28, 2024
Viaarxiv icon

Hardware Neural Control of CartPole and F1TENTH Race Car

Add code
Jul 11, 2024
Figure 1 for Hardware Neural Control of CartPole and F1TENTH Race Car
Figure 2 for Hardware Neural Control of CartPole and F1TENTH Race Car
Figure 3 for Hardware Neural Control of CartPole and F1TENTH Race Car
Figure 4 for Hardware Neural Control of CartPole and F1TENTH Race Car
Viaarxiv icon

Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL

Add code
Jun 08, 2024
Viaarxiv icon

RoboMP$^2$: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models

Add code
Apr 07, 2024
Viaarxiv icon

Dual-View Visual Contextualization for Web Navigation

Add code
Feb 06, 2024
Figure 1 for Dual-View Visual Contextualization for Web Navigation
Figure 2 for Dual-View Visual Contextualization for Web Navigation
Figure 3 for Dual-View Visual Contextualization for Web Navigation
Figure 4 for Dual-View Visual Contextualization for Web Navigation
Viaarxiv icon

GMTalker: Gaussian Mixture based Emotional talking video Portraits

Add code
Dec 12, 2023
Viaarxiv icon