Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Jul 22, 2024

Jiahang Tu, Wei Ji, Hanbin Zhao, Chao Zhang, Roger Zimmermann, Hui Qian

Figure 1 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Figure 2 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Figure 3 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Figure 4 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Share this with someone who'll enjoy it:

Abstract:In autonomous driving, deep models have shown remarkable performance across various visual perception tasks with the demand of high-quality and huge-diversity training datasets. Such datasets are expected to cover various driving scenarios with adverse weather, lighting conditions and diverse moving objects. However, manually collecting these data presents huge challenges and expensive cost. With the rapid development of large generative models, we propose DriveDiTFit, a novel method for efficiently generating autonomous Driving data by Fine-tuning pre-trained Diffusion Transformers (DiTs). Specifically, DriveDiTFit utilizes a gap-driven modulation technique to carefully select and efficiently fine-tune a few parameters in DiTs according to the discrepancy between the pre-trained source data and the target driving data. Additionally, DriveDiTFit develops an effective weather and lighting condition embedding module to ensure diversity in the generated data, which is initialized by a nearest-semantic-similarity initialization approach. Through progressive tuning scheme to refined the process of detail generation in early diffusion process and enlarging the weights corresponding to small objects in training loss, DriveDiTFit ensures high-quality generation of small moving objects in the generated data. Extensive experiments conducted on driving datasets confirm that our method could efficiently produce diverse real driving data. The source codes will be available at https://github.com/TtuHamg/DriveDiTFit.

View paper on

Share this with someone who'll enjoy it:

Title:DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Paper and Code