Picture for Dingyuan Zhang

Dingyuan Zhang

ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation

Add code
Mar 25, 2025
Viaarxiv icon

Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception

Add code
Mar 17, 2025
Viaarxiv icon

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation

Add code
Jan 24, 2025
Figure 1 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Figure 2 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Figure 3 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Figure 4 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Viaarxiv icon

Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression

Add code
Sep 01, 2024
Figure 1 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Figure 2 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Figure 3 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Figure 4 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Viaarxiv icon

AVS-Net: Point Sampling with Adaptive Voxel Size for 3D Point Cloud Analysis

Add code
Feb 27, 2024
Viaarxiv icon

You Only Look Bottom-Up for Monocular 3D Object Detection

Add code
Jan 27, 2024
Viaarxiv icon

SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model

Add code
Jun 04, 2023
Viaarxiv icon