Picture for Ziying Song

Ziying Song

FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection

Add code
Jan 08, 2025
Figure 1 for FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection
Figure 2 for FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection
Figure 3 for FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection
Figure 4 for FGU3R: Fine-Grained Fusion via Unified 3D Representation for Multimodal 3D Object Detection
Viaarxiv icon

TiGDistill-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning Distillation

Add code
Dec 30, 2024
Viaarxiv icon

GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving

Add code
Nov 19, 2024
Figure 1 for GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving
Figure 2 for GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving
Figure 3 for GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving
Figure 4 for GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving
Viaarxiv icon

V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception

Add code
Nov 17, 2024
Figure 1 for V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception
Figure 2 for V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception
Figure 3 for V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception
Figure 4 for V2X-Radar: A Multi-modal Dataset with 4D Radar for Cooperative Perception
Viaarxiv icon

HE-Drive: Human-Like End-to-End Driving with Vision Language Models

Add code
Oct 07, 2024
Viaarxiv icon

CWT-Net: Super-resolution of Histopathology Images Using a Cross-scale Wavelet-based Transformer

Add code
Sep 11, 2024
Viaarxiv icon

SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection

Add code
Jun 16, 2024
Figure 1 for SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection
Figure 2 for SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection
Figure 3 for SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection
Figure 4 for SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection
Viaarxiv icon

ContrastAlign: Toward Robust BEV Feature Alignment via Contrastive Learning for Multi-Modal 3D Object Detection

Add code
May 27, 2024
Viaarxiv icon

M2DA: Multi-Modal Fusion Transformer Incorporating Driver Attention for Autonomous Driving

Add code
Mar 19, 2024
Viaarxiv icon

GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection

Add code
Mar 18, 2024
Figure 1 for GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Figure 2 for GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Figure 3 for GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Figure 4 for GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection
Viaarxiv icon