Picture for Ziang Li

Ziang Li

ManiSplat: Manipulation Trajectory Synthesis from Monocular Video via Decoupled 3D Gaussian Splatting

Add code
Jun 09, 2026
Viaarxiv icon

Light-WAM: Efficient World Action Models with State-Fusion Action Decoding

Add code
Jun 06, 2026
Viaarxiv icon

VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models

Add code
Mar 25, 2026
Viaarxiv icon

HoloBrain-0 Technical Report

Add code
Feb 12, 2026
Viaarxiv icon

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook

Add code
Sep 17, 2025
Figure 1 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 2 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 3 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 4 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Viaarxiv icon

Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations

Add code
Jun 05, 2025
Viaarxiv icon

SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest

Add code
Feb 13, 2025
Figure 1 for SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest
Figure 2 for SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest
Figure 3 for SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest
Figure 4 for SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest
Viaarxiv icon

Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search

Add code
Jul 08, 2024
Figure 1 for Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search
Figure 2 for Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search
Figure 3 for Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search
Figure 4 for Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search
Viaarxiv icon

EXMODD: An EXplanatory Multimodal Open-Domain Dialogue dataset

Add code
Oct 17, 2023
Figure 1 for EXMODD: An EXplanatory Multimodal Open-Domain Dialogue dataset
Figure 2 for EXMODD: An EXplanatory Multimodal Open-Domain Dialogue dataset
Figure 3 for EXMODD: An EXplanatory Multimodal Open-Domain Dialogue dataset
Figure 4 for EXMODD: An EXplanatory Multimodal Open-Domain Dialogue dataset
Viaarxiv icon

"I am the follower, also the boss": Exploring Different Levels of Autonomy and Machine Forms of Guiding Robots for the Visually Impaired

Add code
Feb 07, 2023
Figure 1 for "I am the follower, also the boss": Exploring Different Levels of Autonomy and Machine Forms of Guiding Robots for the Visually Impaired
Figure 2 for "I am the follower, also the boss": Exploring Different Levels of Autonomy and Machine Forms of Guiding Robots for the Visually Impaired
Figure 3 for "I am the follower, also the boss": Exploring Different Levels of Autonomy and Machine Forms of Guiding Robots for the Visually Impaired
Figure 4 for "I am the follower, also the boss": Exploring Different Levels of Autonomy and Machine Forms of Guiding Robots for the Visually Impaired
Viaarxiv icon