Picture for Songcen Xu

Songcen Xu

Noah's Ark Lab, Huawei Technologies, Shenzhen, China

Top 10 Open Challenges Steering the Future of Diffusion Language Model and Its Variants

Add code
Jan 20, 2026
Viaarxiv icon

Map2Thought: Explicit 3D Spatial Reasoning via Metric Cognitive Maps

Add code
Jan 16, 2026
Viaarxiv icon

Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting

Add code
Dec 17, 2025
Figure 1 for Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting
Figure 2 for Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting
Figure 3 for Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting
Figure 4 for Off The Grid: Detection of Primitives for Feed-Forward 3D Gaussian Splatting
Viaarxiv icon

ECCV 2024 W-CODA: 1st Workshop on Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving

Add code
Jul 02, 2025
Figure 1 for ECCV 2024 W-CODA: 1st Workshop on Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving
Viaarxiv icon

MagicEraser: Erasing Any Objects via Semantics-Aware Control

Add code
Oct 14, 2024
Viaarxiv icon

TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling

Add code
Aug 02, 2024
Figure 1 for TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling
Figure 2 for TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling
Figure 3 for TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling
Figure 4 for TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling
Viaarxiv icon

EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head

Add code
Aug 01, 2024
Figure 1 for EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head
Figure 2 for EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head
Figure 3 for EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head
Figure 4 for EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head
Viaarxiv icon

GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction

Add code
Jul 05, 2024
Figure 1 for GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction
Figure 2 for GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction
Figure 3 for GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction
Figure 4 for GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction
Viaarxiv icon

AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding

Add code
Jun 11, 2024
Figure 1 for AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Figure 2 for AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Figure 3 for AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Figure 4 for AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding
Viaarxiv icon

MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections

Add code
May 20, 2024
Figure 1 for MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections
Figure 2 for MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections
Figure 3 for MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections
Figure 4 for MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections
Viaarxiv icon