Picture for Weixin Mao

Weixin Mao

Multi-GraspLLM: A Multimodal LLM for Multi-Hand Semantic Guided Grasp Generation

Add code
Dec 11, 2024
Viaarxiv icon

RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World

Add code
Nov 29, 2024
Viaarxiv icon

SegGrasp: Zero-Shot Task-Oriented Grasping via Semantic and Geometric Guided Segmentation

Add code
Oct 11, 2024
Viaarxiv icon

Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?

Add code
May 28, 2024
Viaarxiv icon

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control

Add code
Mar 28, 2024
Viaarxiv icon

PillarNeSt: Embracing Backbone Scaling and Pretraining for Pillar-based 3D Object Detection

Add code
Nov 29, 2023
Viaarxiv icon

ADriver-I: A General World Model for Autonomous Driving

Add code
Nov 22, 2023
Viaarxiv icon

GMM: Delving into Gradient Aware and Model Perceive Depth Mining for Monocular 3D Detection

Add code
Jun 30, 2023
Viaarxiv icon

Exploring Recurrent Long-term Temporal Fusion for Multi-view 3D Perception

Add code
Mar 13, 2023
Viaarxiv icon

Towards 3D Object Detection with 2D Supervision

Add code
Nov 15, 2022
Viaarxiv icon