Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Stefano Sabatini

Improving Consistency in Vehicle Trajectory Prediction Through Preference Optimization

Jul 03, 2025

Caio Azevedo, Lina Achaji, Stefano Sabatini, Nicola Poerio, Grzegorz Bartyzel, Sascha Hornauer, Fabien Moutarde

Abstract:Trajectory prediction is an essential step in the pipeline of an autonomous vehicle. Inaccurate or inconsistent predictions regarding the movement of agents in its surroundings lead to poorly planned maneuvers and potentially dangerous situations for the end-user. Current state-of-the-art deep-learning-based trajectory prediction models can achieve excellent accuracy on public datasets. However, when used in more complex, interactive scenarios, they often fail to capture important interdependencies between agents, leading to inconsistent predictions among agents in the traffic scene. Inspired by the efficacy of incorporating human preference into large language models, this work fine-tunes trajectory prediction models in multi-agent settings using preference optimization. By taking as input automatically calculated preference rankings among predicted futures in the fine-tuning process, our experiments--using state-of-the-art models on three separate datasets--show that we are able to significantly improve scene consistency while minimally sacrificing trajectory prediction accuracy and without adding any excess computational requirements at inference time.

* Accepted for publication at ITSC 2025

Via

Access Paper or Ask Questions

Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation

Aug 01, 2024

Yixiao Wang, Chen Tang, Lingfeng Sun, Simone Rossi, Yichen Xie, Chensheng Peng, Thomas Hannagan, Stefano Sabatini, Nicola Poerio, Masayoshi Tomizuka(+1 more)

Figure 1 for Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation

Figure 2 for Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation

Figure 3 for Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation

Figure 4 for Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation

Abstract:Diffusion models are promising for joint trajectory prediction and controllable generation in autonomous driving, but they face challenges of inefficient inference steps and high computational demands. To tackle these challenges, we introduce Optimal Gaussian Diffusion (OGD) and Estimated Clean Manifold (ECM) Guidance. OGD optimizes the prior distribution for a small diffusion time $T$ and starts the reverse diffusion process from it. ECM directly injects guidance gradients to the estimated clean manifold, eliminating extensive gradient backpropagation throughout the network. Our methodology streamlines the generative process, enabling practical applications with reduced computational overhead. Experimental validation on the large-scale Argoverse 2 dataset demonstrates our approach's superior performance, offering a viable solution for computationally efficient, high-quality joint trajectory prediction and controllable generation for autonomous driving. Our project webpage is at https://yixiaowang7.github.io/OptTrajDiff_Page/.

* 30 pages, 20 figures, Accepted to ECCV 2024

Via

Access Paper or Ask Questions

Exploiting map information for self-supervised learning in motion forecasting

Oct 10, 2022

Caio Azevedo, Thomas Gilles, Stefano Sabatini, Dzmitry Tsishkou

Figure 1 for Exploiting map information for self-supervised learning in motion forecasting

Figure 2 for Exploiting map information for self-supervised learning in motion forecasting

Figure 3 for Exploiting map information for self-supervised learning in motion forecasting

Figure 4 for Exploiting map information for self-supervised learning in motion forecasting

Abstract:Inspired by recent developments regarding the application of self-supervised learning (SSL), we devise an auxiliary task for trajectory prediction that takes advantage of map-only information such as graph connectivity with the intent of improving map comprehension and generalization. We apply this auxiliary task through two frameworks - multitasking and pretraining. In either framework we observe significant improvement of our baseline in metrics such as $\mathrm{minFDE}_6$ (as much as 20.3%) and $\mathrm{MissRate}_6$ (as much as 33.3%), as well as a richer comprehension of map features demonstrated by different training configurations. The results obtained were consistent in all three data sets used for experiments: Argoverse, Interaction and NuScenes. We also submit our new pretrained model's results to the Interaction challenge and achieve $\textit{1st}$ place with respect to $\mathrm{minFDE}_6$ and $\mathrm{minADE}_6$.

Via

Access Paper or Ask Questions

Exploring the trade off between human driving imitation and safety for traffic simulation

Aug 09, 2022

Yann Koeberle, Stefano Sabatini, Dzmitry Tsishkou, Christophe Sabourin

Figure 1 for Exploring the trade off between human driving imitation and safety for traffic simulation

Figure 2 for Exploring the trade off between human driving imitation and safety for traffic simulation

Figure 3 for Exploring the trade off between human driving imitation and safety for traffic simulation

Figure 4 for Exploring the trade off between human driving imitation and safety for traffic simulation

Abstract:Traffic simulation has gained a lot of interest for quantitative evaluation of self driving vehicles performance. In order for a simulator to be a valuable test bench, it is required that the driving policy animating each traffic agent in the scene acts as humans would do while maintaining minimal safety guarantees. Learning the driving policies of traffic agents from recorded human driving data or through reinforcement learning seems to be an attractive solution for the generation of realistic and highly interactive traffic situations in uncontrolled intersections or roundabouts. In this work, we show that a trade-off exists between imitating human driving and maintaining safety when learning driving policies. We do this by comparing how various Imitation learning and Reinforcement learning algorithms perform when applied to the driving task. We also propose a multi objective learning algorithm (MOPPO) that improves both objectives together. We test our driving policies on highly interactive driving scenarios extracted from INTERACTION Dataset to evaluate how human-like they behave.

Via

Access Paper or Ask Questions

Uncertainty estimation for Cross-dataset performance in Trajectory prediction

May 15, 2022

Thomas Gilles, Stefano Sabatini, Dzmitry Tsishkou, Bogdan Stanciulescu, Fabien Moutarde

Figure 1 for Uncertainty estimation for Cross-dataset performance in Trajectory prediction

Figure 2 for Uncertainty estimation for Cross-dataset performance in Trajectory prediction

Figure 3 for Uncertainty estimation for Cross-dataset performance in Trajectory prediction

Figure 4 for Uncertainty estimation for Cross-dataset performance in Trajectory prediction

Abstract:While a lot of work has been done on developing trajectory prediction methods, and various datasets have been proposed for benchmarking this task, little study has been done so far on the generalizability and the transferability of these methods across dataset. In this paper, we study the performance of a state-of-the-art trajectory prediction method across four different datasets (Argoverse, NuScenes, Interaction, Shifts). We first check how a similar method can be applied and trained on all these datasets with similar hyperparameters. Then we highlight which datasets work best on others, and study how uncertainty estimation allows for a better transferable performance; proposing a novel way to estimate uncertainty and to directly use it in prediction.

* Workshop on Fresh Perspectives on the Future of Autonomous Driving, ICRA 2022

Via

Access Paper or Ask Questions

THOMAS: Trajectory Heatmap Output with learned Multi-Agent Sampling

Oct 17, 2021

Thomas Gilles, Stefano Sabatini, Dzmitry Tsishkou, Bogdan Stanciulescu, Fabien Moutarde

Figure 1 for THOMAS: Trajectory Heatmap Output with learned Multi-Agent Sampling

Figure 2 for THOMAS: Trajectory Heatmap Output with learned Multi-Agent Sampling

Figure 3 for THOMAS: Trajectory Heatmap Output with learned Multi-Agent Sampling

Figure 4 for THOMAS: Trajectory Heatmap Output with learned Multi-Agent Sampling

Abstract:In this paper, we propose THOMAS, a joint multi-agent trajectory prediction framework allowing for efficient and consistent prediction of multi-agent multi-modal trajectories. We present a unified model architecture for fast and simultaneous agent future heatmap estimation leveraging hierarchical and sparse image generation. We demonstrate that heatmap output enables a higher level of control on the predicted trajectories compared to vanilla multi-modal trajectory regression, allowing to incorporate additional constraints for tighter sampling or collision-free predictions in a deterministic way. However, we also highlight that generating scene-consistent predictions goes beyond the mere generation of collision-free trajectories. We therefore propose a learnable trajectory recombination model that takes as input a set of predicted trajectories for each agent and outputs its consistent reordered recombination. We report our results on the Interaction multi-agent prediction challenge and rank $1^{st}$ on the online test leaderboard.

Via

Access Paper or Ask Questions

GOHOME: Graph-Oriented Heatmap Output for future Motion Estimation

Sep 21, 2021

Thomas Gilles, Stefano Sabatini, Dzmitry Tsishkou, Bogdan Stanciulescu, Fabien Moutarde

Figure 1 for GOHOME: Graph-Oriented Heatmap Output for future Motion Estimation

Figure 2 for GOHOME: Graph-Oriented Heatmap Output for future Motion Estimation

Figure 3 for GOHOME: Graph-Oriented Heatmap Output for future Motion Estimation

Figure 4 for GOHOME: Graph-Oriented Heatmap Output for future Motion Estimation

Abstract:In this paper, we propose GOHOME, a method leveraging graph representations of the High Definition Map and sparse projections to generate a heatmap output representing the future position probability distribution for a given agent in a traffic scene. This heatmap output yields an unconstrained 2D grid representation of agent future possible locations, allowing inherent multimodality and a measure of the uncertainty of the prediction. Our graph-oriented model avoids the high computation burden of representing the surrounding context as squared images and processing it with classical CNNs, but focuses instead only on the most probable lanes where the agent could end up in the immediate future. GOHOME reaches 2$nd$ on Argoverse Motion Forecasting Benchmark on the MissRate$_6$ metric while achieving significant speed-up and memory burden diminution compared to Argoverse 1$^{st}$ place method HOME. We also highlight that heatmap output enables multimodal ensembling and improve 1$^{st}$ place MissRate$_6$ by more than 15$\%$ with our best ensemble on Argoverse. Finally, we evaluate and reach state-of-the-art performance on the other trajectory prediction datasets nuScenes and Interaction, demonstrating the generalizability of our method.

Via

Access Paper or Ask Questions

HOME: Heatmap Output for future Motion Estimation

Jun 02, 2021

Thomas Gilles, Stefano Sabatini, Dzmitry Tsishkou, Bogdan Stanciulescu, Fabien Moutarde

Figure 1 for HOME: Heatmap Output for future Motion Estimation

Figure 2 for HOME: Heatmap Output for future Motion Estimation

Figure 3 for HOME: Heatmap Output for future Motion Estimation

Figure 4 for HOME: Heatmap Output for future Motion Estimation

Abstract:In this paper, we propose HOME, a framework tackling the motion forecasting problem with an image output representing the probability distribution of the agent's future location. This method allows for a simple architecture with classic convolution networks coupled with attention mechanism for agent interactions, and outputs an unconstrained 2D top-view representation of the agent's possible future. Based on this output, we design two methods to sample a finite set of agent's future locations. These methods allow us to control the optimization trade-off between miss rate and final displacement error for multiple modalities without having to retrain any part of the model. We apply our method to the Argoverse Motion Forecasting Benchmark and achieve 1st place on the online leaderboard.

Via

Access Paper or Ask Questions