Picture for Siyuan Huang

Siyuan Huang

GWM: Towards Scalable Gaussian World Models for Robotic Manipulation

Add code
Aug 25, 2025
Figure 1 for GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Figure 2 for GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Figure 3 for GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Figure 4 for GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Viaarxiv icon

Spatial-Temporal Multi-Scale Quantization for Flexible Motion Generation

Add code
Aug 12, 2025
Viaarxiv icon

Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation

Add code
Aug 07, 2025
Viaarxiv icon

LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation

Add code
Jun 11, 2025
Figure 1 for LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation
Figure 2 for LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation
Figure 3 for LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation
Figure 4 for LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation
Viaarxiv icon

Cross-Spectral Body Recognition with Side Information Embedding: Benchmarks on LLCM and Analyzing Range-Induced Occlusions on IJB-MDF

Add code
Jun 10, 2025
Viaarxiv icon

CLONE: Closed-Loop Whole-Body Humanoid Teleoperation for Long-Horizon Tasks

Add code
Jun 10, 2025
Viaarxiv icon

RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation

Add code
Jun 07, 2025
Viaarxiv icon

InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing

Add code
May 30, 2025
Viaarxiv icon

Learning Unified Force and Position Control for Legged Loco-Manipulation

Add code
May 27, 2025
Viaarxiv icon

Pretraining Language Models to Ponder in Continuous Space

Add code
May 27, 2025
Viaarxiv icon