Picture for Shiyi Lan

Shiyi Lan

StreamChat: Chatting with Streaming Video

Add code
Dec 11, 2024
Viaarxiv icon

Exploring Camera Encoder Designs for Autonomous Driving Perception

Add code
Jul 09, 2024
Viaarxiv icon

Multi-Dimensional Pruning: Joint Channel, Layer and Block Pruning with Latency Constraint

Add code
Jun 17, 2024
Viaarxiv icon

Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

Add code
Jun 11, 2024
Figure 1 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 2 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 3 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Figure 4 for Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation
Viaarxiv icon

OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning

Add code
May 02, 2024
Figure 1 for OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
Figure 2 for OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
Figure 3 for OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
Figure 4 for OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning
Viaarxiv icon

What is Point Supervision Worth in Video Instance Segmentation?

Add code
Apr 01, 2024
Viaarxiv icon

EVA-GAN: Enhanced Various Audio Generation via Scalable Generative Adversarial Networks

Add code
Jan 31, 2024
Viaarxiv icon

Fully Attentional Networks with Self-emerging Token Labeling

Add code
Jan 08, 2024
Viaarxiv icon

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties

Add code
Dec 21, 2023
Figure 1 for A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Figure 2 for A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Figure 3 for A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Figure 4 for A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties
Viaarxiv icon

Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?

Add code
Dec 05, 2023
Figure 1 for Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?
Figure 2 for Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?
Figure 3 for Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?
Figure 4 for Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?
Viaarxiv icon