Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models

Dec 05, 2024

Zhejun Zhang, Peter Karkus, Maximilian Igl, Wenhao Ding, Yuxiao Chen, Boris Ivanovic, Marco Pavone

Figure 1 for Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models

Figure 2 for Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models

Figure 3 for Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models

Figure 4 for Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models

Share this with someone who'll enjoy it:

Abstract:Traffic simulation aims to learn a policy for traffic agents that, when unrolled in closed-loop, faithfully recovers the joint distribution of trajectories observed in the real world. Inspired by large language models, tokenized multi-agent policies have recently become the state-of-the-art in traffic simulation. However, they are typically trained through open-loop behavior cloning, and thus suffer from covariate shift when executed in closed-loop during simulation. In this work, we present Closest Among Top-K (CAT-K) rollouts, a simple yet effective closed-loop fine-tuning strategy to mitigate covariate shift. CAT-K fine-tuning only requires existing trajectory data, without reinforcement learning or generative adversarial imitation. Concretely, CAT-K fine-tuning enables a small 7M-parameter tokenized traffic simulation policy to outperform a 102M-parameter model from the same model family, achieving the top spot on the Waymo Sim Agent Challenge leaderboard at the time of submission. The code is available at https://github.com/NVlabs/catk.

* Project Page: https://zhejz.github.io/catk/

View paper on

Share this with someone who'll enjoy it:

Title:Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models

Paper and Code