Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Riccardo Pieroni

LCF3D: A Robust and Real-Time Late-Cascade Fusion Framework for 3D Object Detection in Autonomous Driving

Jan 14, 2026

Carlo Sgaravatti, Riccardo Pieroni, Matteo Corno, Sergio M. Savaresi, Luca Magri, Giacomo Boracchi

Abstract:Accurately localizing 3D objects like pedestrians, cyclists, and other vehicles is essential in Autonomous Driving. To ensure high detection performance, Autonomous Vehicles complement RGB cameras with LiDAR sensors, but effectively combining these data sources for 3D object detection remains challenging. We propose LCF3D, a novel sensor fusion framework that combines a 2D object detector on RGB images with a 3D object detector on LiDAR point clouds. By leveraging multimodal fusion principles, we compensate for inaccuracies in the LiDAR object detection network. Our solution combines two key principles: (i) late fusion, to reduce LiDAR False Positives by matching LiDAR 3D detections with RGB 2D detections and filtering out unmatched LiDAR detections; and (ii) cascade fusion, to recover missed objects from LiDAR by generating new 3D frustum proposals corresponding to unmatched RGB detections. Experiments show that LCF3D is beneficial for domain generalization, as it turns out to be successful in handling different sensor configurations between training and testing domains. LCF3D achieves significant improvements over LiDAR-based methods, particularly for challenging categories like pedestrians and cyclists in the KITTI dataset, as well as motorcycles and bicycles in nuScenes. Code can be downloaded from: https://github.com/CarloSgaravatti/LCF3D.

* 35 pages, 14 figures. Published at Pattern Recognition

Via

Access Paper or Ask Questions

A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection

Apr 25, 2025

Carlo Sgaravatti, Roberto Basla, Riccardo Pieroni, Matteo Corno, Sergio M. Savaresi, Luca Magri, Giacomo Boracchi

Figure 1 for A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection

Figure 2 for A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection

Figure 3 for A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection

Figure 4 for A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object Detection

Abstract:We present a new way to detect 3D objects from multimodal inputs, leveraging both LiDAR and RGB cameras in a hybrid late-cascade scheme, that combines an RGB detection network and a 3D LiDAR detector. We exploit late fusion principles to reduce LiDAR False Positives, matching LiDAR detections with RGB ones by projecting the LiDAR bounding boxes on the image. We rely on cascade fusion principles to recover LiDAR False Negatives leveraging epipolar constraints and frustums generated by RGB detections of separate views. Our solution can be plugged on top of any underlying single-modal detectors, enabling a flexible training process that can take advantage of pre-trained LiDAR and RGB detectors, or train the two branches separately. We evaluate our results on the KITTI object detection benchmark, showing significant performance improvements, especially for the detection of Pedestrians and Cyclists.

Via

Access Paper or Ask Questions

Multi-Object Tracking with Camera-LiDAR Fusion for Autonomous Driving

Mar 06, 2024

Riccardo Pieroni, Simone Specchia, Matteo Corno, Sergio Matteo Savaresi

Figure 1 for Multi-Object Tracking with Camera-LiDAR Fusion for Autonomous Driving

Figure 2 for Multi-Object Tracking with Camera-LiDAR Fusion for Autonomous Driving

Figure 3 for Multi-Object Tracking with Camera-LiDAR Fusion for Autonomous Driving

Figure 4 for Multi-Object Tracking with Camera-LiDAR Fusion for Autonomous Driving

Abstract:This paper presents a novel multi-modal Multi-Object Tracking (MOT) algorithm for self-driving cars that combines camera and LiDAR data. Camera frames are processed with a state-of-the-art 3D object detector, whereas classical clustering techniques are used to process LiDAR observations. The proposed MOT algorithm comprises a three-step association process, an Extended Kalman filter for estimating the motion of each detected dynamic obstacle, and a track management phase. The EKF motion model requires the current measured relative position and orientation of the observed object and the longitudinal and angular velocities of the ego vehicle as inputs. Unlike most state-of-the-art multi-modal MOT approaches, the proposed algorithm does not rely on maps or knowledge of the ego global pose. Moreover, it uses a 3D detector exclusively for cameras and is agnostic to the type of LiDAR sensor used. The algorithm is validated both in simulation and with real-world data, with satisfactory results.

* Published at IEEE European Control Conference 2024

Via

Access Paper or Ask Questions