Picture for Zhiheng Li

Zhiheng Li

Super4DR: 4D Radar-centric Self-supervised Odometry and Gaussian-based Map Optimization

Add code
Dec 10, 2025
Figure 1 for Super4DR: 4D Radar-centric Self-supervised Odometry and Gaussian-based Map Optimization
Figure 2 for Super4DR: 4D Radar-centric Self-supervised Odometry and Gaussian-based Map Optimization
Figure 3 for Super4DR: 4D Radar-centric Self-supervised Odometry and Gaussian-based Map Optimization
Figure 4 for Super4DR: 4D Radar-centric Self-supervised Odometry and Gaussian-based Map Optimization
Viaarxiv icon

Towards 3D Object-Centric Feature Learning for Semantic Scene Completion

Add code
Nov 18, 2025
Viaarxiv icon

LlamaSeg: Image Segmentation via Autoregressive Mask Generation

Add code
May 26, 2025
Viaarxiv icon

DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation

Add code
Mar 19, 2025
Figure 1 for DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation
Figure 2 for DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation
Figure 3 for DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation
Figure 4 for DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation
Viaarxiv icon

CAO-RONet: A Robust 4D Radar Odometry with Exploring More Information from Low-Quality Points

Add code
Mar 03, 2025
Figure 1 for CAO-RONet: A Robust 4D Radar Odometry with Exploring More Information from Low-Quality Points
Figure 2 for CAO-RONet: A Robust 4D Radar Odometry with Exploring More Information from Low-Quality Points
Figure 3 for CAO-RONet: A Robust 4D Radar Odometry with Exploring More Information from Low-Quality Points
Figure 4 for CAO-RONet: A Robust 4D Radar Odometry with Exploring More Information from Low-Quality Points
Viaarxiv icon

4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation

Add code
Jan 06, 2025
Viaarxiv icon

Efficient Scaling of Diffusion Transformers for Text-to-Image Generation

Add code
Dec 16, 2024
Figure 1 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 2 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 3 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 4 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Viaarxiv icon

LOMA: Language-assisted Semantic Occupancy Network via Triplane Mamba

Add code
Dec 11, 2024
Viaarxiv icon

Will Large Language Models be a Panacea to Autonomous Driving?

Add code
Sep 24, 2024
Figure 1 for Will Large Language Models be a Panacea to Autonomous Driving?
Figure 2 for Will Large Language Models be a Panacea to Autonomous Driving?
Figure 3 for Will Large Language Models be a Panacea to Autonomous Driving?
Figure 4 for Will Large Language Models be a Panacea to Autonomous Driving?
Viaarxiv icon

Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis

Add code
Sep 13, 2024
Figure 1 for Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis
Figure 2 for Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis
Figure 3 for Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis
Figure 4 for Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis
Viaarxiv icon