Picture for Chenbin Pan

Chenbin Pan

CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow

Add code
Mar 13, 2024
Figure 1 for CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow
Figure 2 for CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow
Figure 3 for CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow
Figure 4 for CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow
Viaarxiv icon

VLP: Vision Language Planning for Autonomous Driving

Add code
Jan 14, 2024
Figure 1 for VLP: Vision Language Planning for Autonomous Driving
Figure 2 for VLP: Vision Language Planning for Autonomous Driving
Figure 3 for VLP: Vision Language Planning for Autonomous Driving
Figure 4 for VLP: Vision Language Planning for Autonomous Driving
Viaarxiv icon

SVT: Supertoken Video Transformer for Efficient Video Understanding

Add code
Apr 23, 2023
Viaarxiv icon

EgoViT: Pyramid Video Transformer for Egocentric Action Recognition

Add code
Mar 15, 2023
Figure 1 for EgoViT: Pyramid Video Transformer for Egocentric Action Recognition
Figure 2 for EgoViT: Pyramid Video Transformer for Egocentric Action Recognition
Figure 3 for EgoViT: Pyramid Video Transformer for Egocentric Action Recognition
Figure 4 for EgoViT: Pyramid Video Transformer for Egocentric Action Recognition
Viaarxiv icon