Picture for Shiyi Lan

Shiyi Lan

Exploring Camera Encoder Designs for Autonomous Driving Perception

Add code
Jul 09, 2024
Viaarxiv icon

Multi-Dimensional Pruning: Joint Channel, Layer and Block Pruning with Latency Constraint

Add code
Jun 17, 2024
Viaarxiv icon

Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

Add code
Jun 11, 2024
Viaarxiv icon

OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning

Add code
May 02, 2024
Viaarxiv icon

What is Point Supervision Worth in Video Instance Segmentation?

Add code
Apr 01, 2024
Viaarxiv icon

EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks

Add code
Jan 31, 2024
Viaarxiv icon

Fully Attentional Networks with Self-emerging Token Labeling

Add code
Jan 08, 2024
Viaarxiv icon

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties

Add code
Dec 21, 2023
Figure 1 for A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Figure 2 for A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Figure 3 for A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Figure 4 for A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Viaarxiv icon

Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?

Add code
Dec 05, 2023
Viaarxiv icon

BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection

Add code
Dec 04, 2023
Viaarxiv icon