Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

David Paz

SMART: Advancing Scalable Map Priors for Driving Topology Reasoning

Feb 06, 2025

Junjie Ye, David Paz, Hengyuan Zhang, Yuliang Guo, Xinyu Huang, Henrik I. Christensen, Yue Wang, Liu Ren

Abstract:Topology reasoning is crucial for autonomous driving as it enables comprehensive understanding of connectivity and relationships between lanes and traffic elements. While recent approaches have shown success in perceiving driving topology using vehicle-mounted sensors, their scalability is hindered by the reliance on training data captured by consistent sensor configurations. We identify that the key factor in scalable lane perception and topology reasoning is the elimination of this sensor-dependent feature. To address this, we propose SMART, a scalable solution that leverages easily available standard-definition (SD) and satellite maps to learn a map prior model, supervised by large-scale geo-referenced high-definition (HD) maps independent of sensor settings. Attributed to scaled training, SMART alone achieves superior offline lane topology understanding using only SD and satellite inputs. Extensive experiments further demonstrate that SMART can be seamlessly integrated into any online topology reasoning methods, yielding significant improvements of up to 28% on the OpenLane-V2 benchmark.

* Accepted by ICRA 2025. Project page: https://jay-ye.github.io/smart

Via

Access Paper or Ask Questions

MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis

Jan 11, 2025

Hengyuan Zhang, David Paz, Yuliang Guo, Xinyu Huang, Henrik I. Christensen, Liu Ren

Abstract:Online mapping reduces the reliance of autonomous vehicles on high-definition (HD) maps, significantly enhancing scalability. However, recent advancements often overlook cross-sensor configuration generalization, leading to performance degradation when models are deployed on vehicles with different camera intrinsics and extrinsics. With the rapid evolution of novel view synthesis methods, we investigate the extent to which these techniques can be leveraged to address the sensor configuration generalization challenge. We propose a novel framework leveraging Gaussian splatting to reconstruct scenes and render camera images in target sensor configurations. The target config sensor data, along with labels mapped to the target config, are used to train online mapping models. Our proposed framework on the nuScenes and Argoverse 2 datasets demonstrates a performance improvement of 18% through effective dataset augmentation, achieves faster convergence and efficient training, and exceeds state-of-the-art performance when using only 25% of the original training data. This enables data reuse and reduces the need for laborious data labeling. Project page at https://henryzhangzhy.github.io/mapgs.

Via

Access Paper or Ask Questions

Enhancing Online Road Network Perception and Reasoning with Standard Definition Maps

Aug 01, 2024

Hengyuan Zhang, David Paz, Yuliang Guo, Arun Das, Xinyu Huang, Karsten Haug, Henrik I. Christensen, Liu Ren

Figure 1 for Enhancing Online Road Network Perception and Reasoning with Standard Definition Maps

Figure 2 for Enhancing Online Road Network Perception and Reasoning with Standard Definition Maps

Figure 3 for Enhancing Online Road Network Perception and Reasoning with Standard Definition Maps

Figure 4 for Enhancing Online Road Network Perception and Reasoning with Standard Definition Maps

Abstract:Autonomous driving for urban and highway driving applications often requires High Definition (HD) maps to generate a navigation plan. Nevertheless, various challenges arise when generating and maintaining HD maps at scale. While recent online mapping methods have started to emerge, their performance especially for longer ranges is limited by heavy occlusion in dynamic environments. With these considerations in mind, our work focuses on leveraging lightweight and scalable priors-Standard Definition (SD) maps-in the development of online vectorized HD map representations. We first examine the integration of prototypical rasterized SD map representations into various online mapping architectures. Furthermore, to identify lightweight strategies, we extend the OpenLane-V2 dataset with OpenStreetMaps and evaluate the benefits of graphical SD map representations. A key finding from designing SD map integration components is that SD map encoders are model agnostic and can be quickly adapted to new architectures that utilize bird's eye view (BEV) encoders. Our results show that making use of SD maps as priors for the online mapping task can significantly speed up convergence and boost the performance of the online centerline perception task by 30% (mAP). Furthermore, we show that the introduction of the SD maps leads to a reduction of the number of parameters in the perception and reasoning task by leveraging SD map graphs while improving the overall performance. Project Page: https://henryzhangzhy.github.io/sdhdmap/.

* Accepted by the 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

Via

Access Paper or Ask Questions

OSM vs HD Maps: Map Representations for Trajectory Prediction

Nov 04, 2023

Jing-Yan Liao, Parth Doshi, Zihan Zhang, David Paz, Henrik Christensen

Abstract:While High Definition (HD) Maps have long been favored for their precise depictions of static road elements, their accessibility constraints and susceptibility to rapid environmental changes impede the widespread deployment of autonomous driving, especially in the motion forecasting task. In this context, we propose to leverage OpenStreetMap (OSM) as a promising alternative to HD Maps for long-term motion forecasting. The contributions of this work are threefold: firstly, we extend the application of OSM to long-horizon forecasting, doubling the forecasting horizon compared to previous studies. Secondly, through an expanded receptive field and the integration of intersection priors, our OSM-based approach exhibits competitive performance, narrowing the gap with HD Map-based models. Lastly, we conduct an exhaustive context-aware analysis, providing deeper insights in motion forecasting across diverse scenarios as well as conducting class-aware comparisons. This research not only advances long-term motion forecasting with coarse map representations but additionally offers a potential scalable solution within the domain of autonomous driving.

Via

Access Paper or Ask Questions

Occlusion-Aware 2D and 3D Centerline Detection for Urban Driving via Automatic Label Generation

Nov 03, 2023

David Paz, Narayanan E. Ranganatha, Srinidhi K. Srinivas, Yunchao Yao, Henrik I. Christensen

Figure 1 for Occlusion-Aware 2D and 3D Centerline Detection for Urban Driving via Automatic Label Generation

Figure 2 for Occlusion-Aware 2D and 3D Centerline Detection for Urban Driving via Automatic Label Generation

Figure 3 for Occlusion-Aware 2D and 3D Centerline Detection for Urban Driving via Automatic Label Generation

Figure 4 for Occlusion-Aware 2D and 3D Centerline Detection for Urban Driving via Automatic Label Generation

Abstract:This research work seeks to explore and identify strategies that can determine road topology information in 2D and 3D under highly dynamic urban driving scenarios. To facilitate this exploration, we introduce a substantial dataset comprising nearly one million automatically labeled data frames. A key contribution of our research lies in developing an automatic label-generation process and an occlusion handling strategy. This strategy is designed to model a wide range of occlusion scenarios, from mild disruptions to severe blockages. Furthermore, we present a comprehensive ablation study wherein multiple centerline detection methods are developed and evaluated. This analysis not only benchmarks the performance of various approaches but also provides valuable insights into the interpretability of these methods. Finally, we demonstrate the practicality of our methods and assess their adaptability across different sensor configurations, highlighting their versatility and relevance in real-world scenarios. Our dataset and experimental models are publicly available.

* 7 pages, 8 figures, 1 algorithm, 11 equations

Via

Access Paper or Ask Questions

CLiNet: Joint Detection of Road Network Centerlines in 2D and 3D

Feb 04, 2023

David Paz, Srinidhi Kalgundi Srinivas, Yunchao Yao, Henrik I. Christensen

Abstract:This work introduces a new approach for joint detection of centerlines based on image data by localizing the features jointly in 2D and 3D. In contrast to existing work that focuses on detection of visual cues, we explore feature extraction methods that are directly amenable to the urban driving task. To develop and evaluate our approach, a large urban driving dataset dubbed AV Breadcrumbs is automatically labeled by leveraging vector map representations and projective geometry to annotate over 900,000 images. Our results demonstrate potential for dynamic scene modeling across various urban driving scenarios. Our model achieves an F1 score of 0.684 and an average normalized depth error of 2.083. The code and data annotations are publicly available.

* 5 pages, 4 figures, 1 table. Under review at IEEE Intelligent Vehicles Symposium 2023

Via

Access Paper or Ask Questions

Robust Human Identity Anonymization using Pose Estimation

Jan 10, 2023

Hengyuan Zhang, Jing-Yan Liao, David Paz, Henrik I. Christensen

Abstract:Many outdoor autonomous mobile platforms require more human identity anonymized data to power their data-driven algorithms. The human identity anonymization should be robust so that less manual intervention is needed, which remains a challenge for current face detection and anonymization systems. In this paper, we propose to use the skeleton generated from the state-of-the-art human pose estimation model to help localize human heads. We develop criteria to evaluate the performance and compare it with the face detection approach. We demonstrate that the proposed algorithm can reduce missed faces and thus better protect the identity information for the pedestrians. We also develop a confidence-based fusion method to further improve the performance.

* 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE), Mexico City, Mexico, 2022, pp. 619-626
* Source code will be available at https://github.com/AutonomousVehicleLaboratory/anonymization

Via

Access Paper or Ask Questions

TridentNetV2: Lightweight Graphical Global Plan Representations for Dynamic Trajectory Generation

Mar 26, 2022

David Paz, Hao Xiang, Andrew Liang, Henrik I. Christensen

Figure 1 for TridentNetV2: Lightweight Graphical Global Plan Representations for Dynamic Trajectory Generation

Figure 2 for TridentNetV2: Lightweight Graphical Global Plan Representations for Dynamic Trajectory Generation

Figure 3 for TridentNetV2: Lightweight Graphical Global Plan Representations for Dynamic Trajectory Generation

Figure 4 for TridentNetV2: Lightweight Graphical Global Plan Representations for Dynamic Trajectory Generation

Abstract:We present a framework for dynamic trajectory generation for autonomous navigation, which does not rely on HD maps as the underlying representation. High Definition (HD) maps have become a key component in most autonomous driving frameworks, which include complete road network information annotated at a centimeter-level that include traversable waypoints, lane information, and traffic signals. Instead, the presented approach models the distributions of feasible ego-centric trajectories in real-time given a nominal graph-based global plan and a lightweight scene representation. By embedding contextual information, such as crosswalks, stop signs, and traffic signals, our approach achieves low errors across multiple urban navigation datasets that include diverse intersection maneuvers, while maintaining real-time performance and reducing network complexity. Underlying datasets introduced are available online.

* 7 pages, Accepted at ICRA 2022

Via

Access Paper or Ask Questions

TridentNet: A Conditional Generative Model for Dynamic Trajectory Generation

Jan 16, 2021

David Paz, Hengyuan Zhang, Henrik I. Christensen

Figure 1 for TridentNet: A Conditional Generative Model for Dynamic Trajectory Generation

Figure 2 for TridentNet: A Conditional Generative Model for Dynamic Trajectory Generation

Figure 3 for TridentNet: A Conditional Generative Model for Dynamic Trajectory Generation

Figure 4 for TridentNet: A Conditional Generative Model for Dynamic Trajectory Generation

Abstract:In recent years, various state of the art autonomous vehicle systems and architectures have been introduced. These methods include planners that depend on high-definition (HD) maps and models that learn an autonomous agent's controls in an end-to-end fashion. While end-to-end models are geared towards solving the scalability constraints from HD maps, they do not generalize for different vehicles and sensor configurations. To address these shortcomings, we introduce an approach that leverages lightweight map representations, explicitly enforcing geometric constraints, and learns feasible trajectories using a conditional generative model. Additional contributions include a new dataset that is used to verify our proposed models quantitatively. The results indicate low relative errors that can potentially translate to traversable trajectories. The dataset created as part of this work has been made available online.

* 13 pages, 6 figures, submitted to IAS-16

Via

Access Paper or Ask Questions

Auto-calibration Method Using Stop Signs for Urban Autonomous Driving Applications

Oct 14, 2020

Yunhai Han, Yuhan Liu, David Paz, Henrik Christensen

Figure 1 for Auto-calibration Method Using Stop Signs for Urban Autonomous Driving Applications

Figure 2 for Auto-calibration Method Using Stop Signs for Urban Autonomous Driving Applications

Figure 3 for Auto-calibration Method Using Stop Signs for Urban Autonomous Driving Applications

Figure 4 for Auto-calibration Method Using Stop Signs for Urban Autonomous Driving Applications

Abstract:For use of cameras on an intelligent vehicle, driving over a major bump could challenge the calibration. It is then of interest to do dynamic calibration. What structures can be used for calibration? How about using traffic signs that you recognize? In this paper an approach is presented for dynamic camera calibration based on recognition of stop signs. The detection is performed based on convolutional neural networks (CNNs). A recognized sign is modeled as a polygon and matched to a model. Parameters are tracked over time. Experimental results show clear convergence and improved performance for the calibration.

* 8 pages, 15 figures, In review of RA-L/ICRA

Via

Access Paper or Ask Questions