Picture for Zhiqi Li

Zhiqi Li

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

Add code
Mar 18, 2025
Viaarxiv icon

VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames

Add code
Mar 13, 2025
Viaarxiv icon

Token-Efficient Long Video Understanding for Multimodal LLMs

Add code
Mar 06, 2025
Viaarxiv icon

ImagineMap: Enhanced HD Map Construction with SD Maps

Add code
Dec 22, 2024
Viaarxiv icon

Driving with InternVL: Oustanding Champion in the Track on Driving with Language of the Autonomous Grand Challenge at CVPR 2024

Add code
Dec 10, 2024
Viaarxiv icon

Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction

Add code
Dec 09, 2024
Viaarxiv icon

Language Models are Symbolic Learners in Arithmetic

Add code
Oct 21, 2024
Viaarxiv icon

DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation

Add code
Oct 09, 2024
Figure 1 for DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation
Figure 2 for DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation
Figure 3 for DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation
Figure 4 for DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation
Viaarxiv icon

Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

Add code
Jun 11, 2024
Figure 1 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 2 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 3 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 4 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Viaarxiv icon

Controllable Text-to-3D Generation via Surface-Aligned Gaussian Splatting

Add code
Mar 19, 2024
Viaarxiv icon