Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models

32citations

arXiv:2412.05334

citations

#225

in CVPR 2025

of 2873 papers

Top Authors

Data Points

Top Authors

Zhejun Zhang Peter Karkus Maximilian Igl Wenhao Ding Yuxiao Chen Boris Ivanovic Marco Pavone

Abstract

Traffic simulation aims to learn a policy for traffic agents that, when unrolled in closed-loop, faithfully recovers the joint distribution of trajectories observed in the real world. Inspired by large language models, tokenized multi-agent policies have recently become the state-of-the-art in traffic simulation. However, they are typically trained through open-loop behavior cloning, and thus suffer from covariate shift when executed in closed-loop during simulation. In this work, we present Closest Among Top-K (CAT-K) rollouts, a simple yet effective closed-loop fine-tuning strategy to mitigate covariate shift. CAT-K fine-tuning only requires existing trajectory data, without reinforcement learning or generative adversarial imitation. Concretely, CAT-K fine-tuning enables a small 7M-parameter tokenized traffic simulation policy to outperform a 102M-parameter model from the same model family, achieving the top spot on the Waymo Sim Agent Challenge leaderboard at the time of submission. The code is available at https://github.com/NVlabs/catk.

Citation History

Jan 24, 2026

Feb 13, 2026

32+4

Feb 13, 2026