Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lihui Peng

Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving

Dec 13, 2024

Zhihang Song, Lihui Peng, Jianming Hu, Danya Yao, Yi Zhang

Figure 1 for Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving

Figure 2 for Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving

Figure 3 for Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving

Figure 4 for Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving

Abstract:The multi-modal perception methods are thriving in the autonomous driving field due to their better usage of complementary data from different sensors. Such methods depend on calibration and synchronization between sensors to get accurate environmental information. There have already been studies about space-alignment robustness in autonomous driving object detection process, however, the research for time-alignment is relatively few. As in reality experiments, LiDAR point clouds are more challenging for real-time data transfer, our study used historical frames of LiDAR to better align features when the LiDAR data lags exist. We designed a Timealign module to predict and combine LiDAR features with observation to tackle such time misalignment based on SOTA GraphBEV framework.

* 8 pages, 3 figures

Via

Access Paper or Ask Questions

Advancing Auto-Regressive Continuation for Video Frames

Dec 04, 2024

Ruibo Ming, Jingwei Wu, Zhewei Huang, Zhuoxuan Ju, Jianming HU, Lihui Peng, Shuchang Zhou

Figure 1 for Advancing Auto-Regressive Continuation for Video Frames

Figure 2 for Advancing Auto-Regressive Continuation for Video Frames

Figure 3 for Advancing Auto-Regressive Continuation for Video Frames

Figure 4 for Advancing Auto-Regressive Continuation for Video Frames

Abstract:Recent advances in auto-regressive large language models (LLMs) have shown their potential in generating high-quality text, inspiring researchers to apply them to image and video generation. This paper explores the application of LLMs to video continuation, a task essential for building world models and predicting future frames. In this paper, we tackle challenges including preventing degeneration in long-term frame generation and enhancing the quality of generated images. We design a scheme named ARCON, which involves training our model to alternately generate semantic tokens and RGB tokens, enabling the LLM to explicitly learn and predict the high-level structural information of the video. We find high consistency in the RGB images and semantic maps generated without special design. Moreover, we employ an optical flow-based texture stitching method to enhance the visual quality of the generated videos. Quantitative and qualitative experiments in autonomous driving scenarios demonstrate our model can consistently generate long videos.

* Under Review

Via

Access Paper or Ask Questions

A re-calibration method for object detection with multi-modal alignment bias in autonomous driving

May 27, 2024

Zhihang Song, Lihui Peng, Jianming Hu, Danya Yao, Yi Zhang

Figure 1 for A re-calibration method for object detection with multi-modal alignment bias in autonomous driving

Figure 2 for A re-calibration method for object detection with multi-modal alignment bias in autonomous driving

Figure 3 for A re-calibration method for object detection with multi-modal alignment bias in autonomous driving

Figure 4 for A re-calibration method for object detection with multi-modal alignment bias in autonomous driving

Abstract:Multi-modal object detection in autonomous driving has achieved great breakthroughs due to the usage of fusing complementary information from different sensors. The calibration in fusion between sensors such as LiDAR and camera is always supposed to be precise in previous work. However, in reality, calibration matrices are fixed when the vehicles leave the factory, but vibration, bumps, and data lags may cause calibration bias. As the research on the calibration influence on fusion detection performance is relatively few, flexible calibration dependency multi-sensor detection method has always been attractive. In this paper, we conducted experiments on SOTA detection method EPNet++ and proved slight bias on calibration can reduce the performance seriously. We also proposed a re-calibration model based on semantic segmentation which can be combined with a detection algorithm to improve the performance and robustness of multi-modal calibration bias.

* 10 pages, 6 figures

Via

Access Paper or Ask Questions

A Survey on Video Prediction: From Deterministic to Generative Approaches

Jan 31, 2024

Ruibo Ming, Zhewei Huang, Zhuoxuan Ju, Jianming Hu, Lihui Peng, Shuchang Zhou

Figure 1 for A Survey on Video Prediction: From Deterministic to Generative Approaches

Figure 2 for A Survey on Video Prediction: From Deterministic to Generative Approaches

Abstract:Video prediction, a fundamental task in computer vision, aims to enable models to generate sequences of future frames based on existing video content. This task has garnered widespread application across various domains. In this paper, we comprehensively survey both historical and contemporary works in this field, encompassing the most widely used datasets and algorithms. Our survey scrutinizes the challenges and evolving landscape of video prediction within the realm of computer vision. We propose a novel taxonomy centered on the stochastic nature of video prediction algorithms. This taxonomy accentuates the gradual transition from deterministic to generative prediction methodologies, underlining significant advancements and shifts in approach.

* under review

Via

Access Paper or Ask Questions

Synthetic Datasets for Autonomous Driving: A Survey

Apr 24, 2023

Zhihang Song, Zimin He, Xingyu Li, Qiming Ma, Ruibo Ming, Zhiqi Mao, Huaxin Pei, Lihui Peng, Jianming Hu, Danya Yao(+1 more)

Figure 1 for Synthetic Datasets for Autonomous Driving: A Survey

Figure 2 for Synthetic Datasets for Autonomous Driving: A Survey

Figure 3 for Synthetic Datasets for Autonomous Driving: A Survey

Figure 4 for Synthetic Datasets for Autonomous Driving: A Survey

Abstract:Autonomous driving techniques have been flourishing in recent years while thirsting for huge amounts of high-quality data. However, it is difficult for real-world datasets to keep up with the pace of changing requirements due to their expensive and time-consuming experimental and labeling costs. Therefore, more and more researchers are turning to synthetic datasets to easily generate rich and changeable data as an effective complement to the real world and to improve the performance of algorithms. In this paper, we summarize the evolution of synthetic dataset generation methods and review the work to date in synthetic datasets related to single and multi-task categories for to autonomous driving study. We also discuss the role that synthetic dataset plays the evaluation, gap test, and positive effect in autonomous driving related algorithm testing, especially on trustworthiness and safety aspects. Finally, we discuss general trends and possible development directions. To the best of our knowledge, this is the first survey focusing on the application of synthetic datasets in autonomous driving. This survey also raises awareness of the problems of real-world deployment of autonomous driving technology and provides researchers with a possible solution.

* 19 pages, 5 figures

Via

Access Paper or Ask Questions