Picture for Zhiheng Li

Zhiheng Li

DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation

Add code
Mar 19, 2025
Viaarxiv icon

CAO-RONet: A Robust 4D Radar Odometry with Exploring More Information from Low-Quality Points

Add code
Mar 03, 2025
Viaarxiv icon

4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation

Add code
Jan 06, 2025
Viaarxiv icon

Efficient Scaling of Diffusion Transformers for Text-to-Image Generation

Add code
Dec 16, 2024
Figure 1 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 2 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 3 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Figure 4 for Efficient Scaling of Diffusion Transformers for Text-to-Image Generation
Viaarxiv icon

LOMA: Language-assisted Semantic Occupancy Network via Triplane Mamba

Add code
Dec 11, 2024
Viaarxiv icon

Will Large Language Models be a Panacea to Autonomous Driving?

Add code
Sep 24, 2024
Figure 1 for Will Large Language Models be a Panacea to Autonomous Driving?
Figure 2 for Will Large Language Models be a Panacea to Autonomous Driving?
Figure 3 for Will Large Language Models be a Panacea to Autonomous Driving?
Figure 4 for Will Large Language Models be a Panacea to Autonomous Driving?
Viaarxiv icon

Rhythmic Foley: A Framework For Seamless Audio-Visual Alignment In Video-to-Audio Synthesis

Add code
Sep 13, 2024
Viaarxiv icon

Trajectory Planning for Teleoperated Space Manipulators Using Deep Reinforcement Learning

Add code
Aug 10, 2024
Viaarxiv icon

Integrating Controllable Motion Skills from Demonstrations

Add code
Aug 06, 2024
Figure 1 for Integrating Controllable Motion Skills from Demonstrations
Figure 2 for Integrating Controllable Motion Skills from Demonstrations
Figure 3 for Integrating Controllable Motion Skills from Demonstrations
Figure 4 for Integrating Controllable Motion Skills from Demonstrations
Viaarxiv icon

StreamMOS: Streaming Moving Object Segmentation with Multi-View Perception and Dual-Span Memory

Add code
Jul 25, 2024
Viaarxiv icon