Picture for Zhuoling Li

Zhuoling Li

VIRT: Vision Instructed Transformer for Robotic Manipulation

Add code
Oct 09, 2024
Figure 1 for VIRT: Vision Instructed Transformer for Robotic Manipulation
Figure 2 for VIRT: Vision Instructed Transformer for Robotic Manipulation
Figure 3 for VIRT: Vision Instructed Transformer for Robotic Manipulation
Figure 4 for VIRT: Vision Instructed Transformer for Robotic Manipulation
Viaarxiv icon

TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers

Add code
Aug 25, 2024
Figure 1 for TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers
Figure 2 for TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers
Figure 3 for TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers
Figure 4 for TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers
Viaarxiv icon

LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence

Add code
May 27, 2024
Figure 1 for LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
Figure 2 for LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
Figure 3 for LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
Figure 4 for LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
Viaarxiv icon

DisC-GS: Discontinuity-aware Gaussian Splatting

Add code
May 24, 2024
Figure 1 for DisC-GS: Discontinuity-aware Gaussian Splatting
Figure 2 for DisC-GS: Discontinuity-aware Gaussian Splatting
Figure 3 for DisC-GS: Discontinuity-aware Gaussian Splatting
Viaarxiv icon

UniMODE: Unified Monocular 3D Object Detection

Add code
Feb 28, 2024
Figure 1 for UniMODE: Unified Monocular 3D Object Detection
Figure 2 for UniMODE: Unified Monocular 3D Object Detection
Figure 3 for UniMODE: Unified Monocular 3D Object Detection
Figure 4 for UniMODE: Unified Monocular 3D Object Detection
Viaarxiv icon

GroupLane: End-to-End 3D Lane Detection with Channel-wise Grouping

Add code
Jul 18, 2023
Viaarxiv icon

The 1st-place Solution for CVPR 2023 OpenLane Topology in Autonomous Driving Challenge

Add code
Jun 16, 2023
Viaarxiv icon

MOTRv3: Release-Fetch Supervision for End-to-End Multi-Object Tracking

Add code
May 23, 2023
Viaarxiv icon

VoxelFormer: Bird's-Eye-View Feature Generation based on Dual-view Attention for Multi-view 3D Object Detection

Add code
Apr 03, 2023
Viaarxiv icon

Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation

Add code
Dec 03, 2022
Figure 1 for Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation
Figure 2 for Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation
Figure 3 for Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation
Figure 4 for Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation
Viaarxiv icon