Picture for Shaoshuai Shi

Shaoshuai Shi

AMP: Autoregressive Motion Prediction Revisited with Next Token Prediction for Autonomous Driving

Add code
Mar 21, 2024
Viaarxiv icon

GiT: Towards Generalist Vision Transformer through Universal Language Interface

Add code
Mar 14, 2024
Figure 1 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Figure 2 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Figure 3 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Figure 4 for GiT: Towards Generalist Vision Transformer through Universal Language Interface
Viaarxiv icon

UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation

Add code
Aug 15, 2023
Viaarxiv icon

MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and Guided Intention Querying

Add code
Jun 30, 2023
Viaarxiv icon

TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses

Add code
Jun 09, 2023
Figure 1 for TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses
Figure 2 for TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses
Figure 3 for TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses
Figure 4 for TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses
Viaarxiv icon

Self-supervised Pre-training with Masked Shape Prediction for 3D Scene Understanding

Add code
May 08, 2023
Viaarxiv icon

Sparse Dense Fusion for 3D Object Detection

Add code
Apr 09, 2023
Viaarxiv icon

Virtual Sparse Convolution for Multimodal 3D Object Detection

Add code
Mar 04, 2023
Viaarxiv icon

DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets

Add code
Jan 15, 2023
Viaarxiv icon

ConQueR: Query Contrast Voxel-DETR for 3D Object Detection

Add code
Dec 14, 2022
Viaarxiv icon