Picture for Siyuan Huang

Siyuan Huang

SciFig: Towards Automating Scientific Figure Generation

Add code
Jan 07, 2026
Viaarxiv icon

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Add code
Dec 30, 2025
Viaarxiv icon

UniAct: Unified Motion Generation and Action Streaming for Humanoid Robots

Add code
Dec 30, 2025
Viaarxiv icon

3D Scene Change Modeling With Consistent Multi-View Aggregation

Add code
Dec 28, 2025
Viaarxiv icon

Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation

Add code
Dec 10, 2025
Viaarxiv icon

VideoSSR: Video Self-Supervised Reinforcement Learning

Add code
Nov 09, 2025
Viaarxiv icon

Learning Human-Humanoid Coordination for Collaborative Object Carrying

Add code
Oct 16, 2025
Viaarxiv icon

GWM: Towards Scalable Gaussian World Models for Robotic Manipulation

Add code
Aug 25, 2025
Figure 1 for GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Figure 2 for GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Figure 3 for GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Figure 4 for GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Viaarxiv icon

Spatial-Temporal Multi-Scale Quantization for Flexible Motion Generation

Add code
Aug 12, 2025
Viaarxiv icon

Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation

Add code
Aug 07, 2025
Figure 1 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Figure 2 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Figure 3 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Figure 4 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Viaarxiv icon