Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haoxuan Dou

InfinityDrive: Breaking Time Limits in Driving World Models

Dec 02, 2024

Xi Guo, Chenjing Ding, Haoxuan Dou, Xin Zhang, Weixuan Tang, Wei Wu

Figure 1 for InfinityDrive: Breaking Time Limits in Driving World Models

Figure 2 for InfinityDrive: Breaking Time Limits in Driving World Models

Figure 3 for InfinityDrive: Breaking Time Limits in Driving World Models

Figure 4 for InfinityDrive: Breaking Time Limits in Driving World Models

Abstract:Autonomous driving systems struggle with complex scenarios due to limited access to diverse, extensive, and out-of-distribution driving data which are critical for safe navigation. World models offer a promising solution to this challenge; however, current driving world models are constrained by short time windows and limited scenario diversity. To bridge this gap, we introduce InfinityDrive, the first driving world model with exceptional generalization capabilities, delivering state-of-the-art performance in high fidelity, consistency, and diversity with minute-scale video generation. InfinityDrive introduces an efficient spatio-temporal co-modeling module paired with an extended temporal training strategy, enabling high-resolution (576$\times$1024) video generation with consistent spatial and temporal coherence. By incorporating memory injection and retention mechanisms alongside an adaptive memory curve loss to minimize cumulative errors, achieving consistent video generation lasting over 1500 frames (approximately 2 minutes). Comprehensive experiments in multiple datasets validate InfinityDrive's ability to generate complex and varied scenarios, highlighting its potential as a next-generation driving world model built for the evolving demands of autonomous driving. Our project homepage: https://metadrivescape.github.io/papers_project/InfinityDrive/page.html

* project homepage: https://metadrivescape.github.io/papers_project/InfinityDrive/page.html

Via

Access Paper or Ask Questions

AutoSampling: Search for Effective Data Sampling Schedules

May 28, 2021

Ming Sun, Haoxuan Dou, Baopu Li, Lei Cui, Junjie Yan, Wanli Ouyang

Figure 1 for AutoSampling: Search for Effective Data Sampling Schedules

Figure 2 for AutoSampling: Search for Effective Data Sampling Schedules

Figure 3 for AutoSampling: Search for Effective Data Sampling Schedules

Figure 4 for AutoSampling: Search for Effective Data Sampling Schedules

Abstract:Data sampling acts as a pivotal role in training deep learning models. However, an effective sampling schedule is difficult to learn due to the inherently high dimension of parameters in learning the sampling schedule. In this paper, we propose an AutoSampling method to automatically learn sampling schedules for model training, which consists of the multi-exploitation step aiming for optimal local sampling schedules and the exploration step for the ideal sampling distribution. More specifically, we achieve sampling schedule search with shortened exploitation cycle to provide enough supervision. In addition, we periodically estimate the sampling distribution from the learned sampling schedules and perturb it to search in the distribution space. The combination of two searches allows us to learn a robust sampling schedule. We apply our AutoSampling method to a variety of image classification tasks illustrating the effectiveness of the proposed method.

* ICML 2021
* Automl for sampling firstly without any assumpation

Via

Access Paper or Ask Questions

Efficient Transfer Learning via Joint Adaptation of Network Architecture and Weight

May 19, 2021

Ming Sun, Haoxuan Dou, Junjie Yan

Figure 1 for Efficient Transfer Learning via Joint Adaptation of Network Architecture and Weight

Figure 2 for Efficient Transfer Learning via Joint Adaptation of Network Architecture and Weight

Figure 3 for Efficient Transfer Learning via Joint Adaptation of Network Architecture and Weight

Figure 4 for Efficient Transfer Learning via Joint Adaptation of Network Architecture and Weight

Abstract:Transfer learning can boost the performance on the targettask by leveraging the knowledge of the source domain. Recent worksin neural architecture search (NAS), especially one-shot NAS, can aidtransfer learning by establishing sufficient network search space. How-ever, existing NAS methods tend to approximate huge search spaces byexplicitly building giant super-networks with multiple sub-paths, anddiscard super-network weights after a child structure is found. Both thecharacteristics of existing approaches causes repetitive network trainingon source tasks in transfer learning. To remedy the above issues, we re-duce the super-network size by randomly dropping connection betweennetwork blocks while embedding a larger search space. Moreover, wereuse super-network weights to avoid redundant training by proposinga novel framework consisting of two modules, the neural architecturesearch module for architecture transfer and the neural weight searchmodule for weight transfer. These two modules conduct search on thetarget task based on a reduced super-networks, so we only need to trainonce on the source task. We experiment our framework on both MS-COCO and CUB-200 for the object detection and fine-grained imageclassification tasks, and show promising improvements with onlyO(CN)super-network complexity.

* ECCV 2020
* NAS is one part of transfer learning

Via

Access Paper or Ask Questions