Picture for Hanming Deng

Hanming Deng

M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving

Add code
Mar 19, 2024
Viaarxiv icon

DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving

Add code
Dec 25, 2023
Figure 1 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 2 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 3 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Figure 4 for DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving
Viaarxiv icon

Scene as Occupancy

Add code
Jun 06, 2023
Viaarxiv icon

3D Data Augmentation for Driving Scenes on Camera

Add code
Mar 18, 2023
Viaarxiv icon

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe

Add code
Sep 12, 2022
Figure 1 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 2 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 3 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 4 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Viaarxiv icon

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting

Add code
Sep 07, 2021
Figure 1 for FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting
Figure 2 for FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting
Figure 3 for FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting
Figure 4 for FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting
Viaarxiv icon

Decoupled Spatial-Temporal Transformer for Video Inpainting

Add code
Apr 14, 2021
Figure 1 for Decoupled Spatial-Temporal Transformer for Video Inpainting
Figure 2 for Decoupled Spatial-Temporal Transformer for Video Inpainting
Figure 3 for Decoupled Spatial-Temporal Transformer for Video Inpainting
Figure 4 for Decoupled Spatial-Temporal Transformer for Video Inpainting
Viaarxiv icon

1st Place Solution of LVIS Challenge 2020: A Good Box is not a Guarantee of a Good Mask

Add code
Sep 03, 2020
Viaarxiv icon