Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tomasz Piotr Kucner

Discrete Contrastive Learning for Diffusion Policies in Autonomous Driving

Mar 07, 2025

Kalle Kujanpää, Daulet Baimukashev, Farzeen Munir, Shoaib Azam, Tomasz Piotr Kucner, Joni Pajarinen, Ville Kyrki

Abstract:Learning to perform accurate and rich simulations of human driving behaviors from data for autonomous vehicle testing remains challenging due to human driving styles' high diversity and variance. We address this challenge by proposing a novel approach that leverages contrastive learning to extract a dictionary of driving styles from pre-existing human driving data. We discretize these styles with quantization, and the styles are used to learn a conditional diffusion policy for simulating human drivers. Our empirical evaluation confirms that the behaviors generated by our approach are both safer and more human-like than those of the machine-learning-based baseline methods. We believe this has the potential to enable higher realism and more effective techniques for evaluating and improving the performance of autonomous vehicles.

Via

Access Paper or Ask Questions

Context-aware Multi-task Learning for Pedestrian Intent and Trajectory Prediction

Jul 24, 2024

Farzeen Munir, Tomasz Piotr Kucner

Abstract:The advancement of socially-aware autonomous vehicles hinges on precise modeling of human behavior. Within this broad paradigm, the specific challenge lies in accurately predicting pedestrian's trajectory and intention. Traditional methodologies have leaned heavily on historical trajectory data, frequently overlooking vital contextual cues such as pedestrian-specific traits and environmental factors. Furthermore, there's a notable knowledge gap as trajectory and intention prediction have largely been approached as separate problems, despite their mutual dependence. To bridge this gap, we introduce PTINet (Pedestrian Trajectory and Intention Prediction Network), which jointly learns the trajectory and intention prediction by combining past trajectory observations, local contextual features (individual pedestrian behaviors), and global features (signs, markings etc.). The efficacy of our approach is evaluated on widely used public datasets: JAAD and PIE, where it has demonstrated superior performance over existing state-of-the-art models in trajectory and intention prediction. The results from our experiments and ablation studies robustly validate PTINet's effectiveness in jointly exploring intention and trajectory prediction for pedestrian behaviour modelling. The experimental evaluation indicates the advantage of using global and local contextual features for pedestrian trajectory and intention prediction. The effectiveness of PTINet in predicting pedestrian behavior paves the way for the development of automated systems capable of seamlessly interacting with pedestrians in urban settings.

Via

Access Paper or Ask Questions

Learning State-Space Models for Mapping Spatial Motion Patterns

Sep 01, 2023

Junyi Shi, Tomasz Piotr Kucner

Figure 1 for Learning State-Space Models for Mapping Spatial Motion Patterns

Figure 2 for Learning State-Space Models for Mapping Spatial Motion Patterns

Figure 3 for Learning State-Space Models for Mapping Spatial Motion Patterns

Figure 4 for Learning State-Space Models for Mapping Spatial Motion Patterns

Abstract:Mapping the surrounding environment is essential for the successful operation of autonomous robots. While extensive research has focused on mapping geometric structures and static objects, the environment is also influenced by the movement of dynamic objects. Incorporating information about spatial motion patterns can allow mobile robots to navigate and operate successfully in populated areas. In this paper, we propose a deep state-space model that learns the map representations of spatial motion patterns and how they change over time at a certain place. To evaluate our methods, we use two different datasets: one generated dataset with specific motion patterns and another with real-world pedestrian data. We test the performance of our model by evaluating its learning ability, mapping quality, and application to downstream tasks. The results demonstrate that our model can effectively learn the corresponding motion pattern, and has the potential to be applied to robotic application tasks.

* 6 pages, 5 figures, to be published in ECMR 2023 conference proceedings

Via

Access Paper or Ask Questions

Generating people flow from architecture of real unseen environments

Aug 23, 2022

Francesco Verdoja, Tomasz Piotr Kucner, Ville Kyrki

Figure 1 for Generating people flow from architecture of real unseen environments

Figure 2 for Generating people flow from architecture of real unseen environments

Figure 3 for Generating people flow from architecture of real unseen environments

Abstract:Mapping people dynamics is a crucial skill, because it enables robots to coexist in human-inhabited environments. However, learning a model of people dynamics is a time consuming process which requires observation of large amount of people moving in an environment. Moreover, approaches for mapping dynamics are unable to transfer the learned models across environments: each model only able to describe the dynamics of the environment it has been built in. However, the effect of architectural geometry on people movement can be used to estimate their dynamics, and recent work has looked into learning maps of dynamics from geometry. So far however, these methods have evaluated their performance only on small-size synthetic data, leaving the actual ability of these approaches to generalize to real conditions unexplored. In this work we propose a novel approach to learn people dynamics from geometry, where a model is trained and evaluated on real human trajectories in large-scale environments. We then show the ability of our method to generalize to unseen environments, which is unprecedented for maps of dynamics.

* Presented at the "Perception and Navigation for Autonomous Robotics in Unstructured and Dynamic Environments" (PNARUDE) workshop at IROS 2022

Via

Access Paper or Ask Questions

Robust Structure Identification and Room Segmentation of Cluttered Indoor Environments from Occupancy Grid Maps

Mar 07, 2022

Matteo Luperto, Tomasz Piotr Kucner, Andrea Tassi, Martin Magnusson, Francesco Amigoni

Figure 1 for Robust Structure Identification and Room Segmentation of Cluttered Indoor Environments from Occupancy Grid Maps

Figure 2 for Robust Structure Identification and Room Segmentation of Cluttered Indoor Environments from Occupancy Grid Maps

Figure 3 for Robust Structure Identification and Room Segmentation of Cluttered Indoor Environments from Occupancy Grid Maps

Figure 4 for Robust Structure Identification and Room Segmentation of Cluttered Indoor Environments from Occupancy Grid Maps

Abstract:Identifying the environment's structure, i.e., to detect core components as rooms and walls, can facilitate several tasks fundamental for the successful operation of indoor autonomous mobile robots, including semantic environment understanding. These robots often rely on 2D occupancy maps for core tasks such as localisation and motion and task planning. However, reliable identification of structure and room segmentation from 2D occupancy maps is still an open problem due to clutter (e.g., furniture and movable object), occlusions, and partial coverage. We propose a method for the RObust StructurE identification and ROom SEgmentation (ROSE^2 ) of 2D occupancy maps, which may be cluttered and incomplete. ROSE^2 identifies the main directions of walls and is resilient to clutter and partial observations, allowing to extract a clean, abstract geometrical floor-plan-like description of the environment, which is used to segment, i.e., to identify rooms in, the original occupancy grid map. ROSE^2 is tested in several real-world publicly-available cluttered maps obtained in different conditions. The results show how it can robustly identify the environment structure in 2D occupancy maps suffering from clutter and partial observations, while significantly improving room segmentation accuracy. Thanks to the combination of clutter removal and robust room segmentation ROSE^2 consistently achieves higher performance than the state-of-the-art methods, against which it is compared.

* Preprint submitted to IEEE RAL + IROS 2022

Via

Access Paper or Ask Questions

Robust Frequency-Based Structure Extraction

Apr 19, 2020

Tomasz Piotr Kucner, Stephanie Lowry, Martin Magnusson, Achim J. Lilienthal

Figure 1 for Robust Frequency-Based Structure Extraction

Figure 2 for Robust Frequency-Based Structure Extraction

Figure 3 for Robust Frequency-Based Structure Extraction

Figure 4 for Robust Frequency-Based Structure Extraction

Abstract:We propose a method for measuring how well each point in an indoor 2D robot map agrees with the underlying structure that governs the construction of the environment. This structure scoring has applications for, e. g., easier robot deployment and Cleaning of maps. In particular, we demonstrate its effectiveness for removing clutter and artifacts from real-world maps, which in turn is an enabler for other map processing components, e. g., room segmentation. Starting from the Fourier transform, we detect peaks in the unfolded frequency spectrum that correspond to a set of dominant directions. This allows us to reconstruct a nominal reference map and score the input map through its correspondence with this reference, without requiring access to a ground-truth map.

* for test implementation check: https://github.com/tkucner/rose

Via

Access Paper or Ask Questions

A Next-Best-Smell Approach for Remote Gas Detection with a Mobile Robot

Jan 21, 2018

Riccardo Polvara, Marco Trabattoni, Tomasz Piotr Kucner, Erik Schaffernicht, Francesco Amigoni, Achim J. Lilienthal

Figure 1 for A Next-Best-Smell Approach for Remote Gas Detection with a Mobile Robot

Figure 2 for A Next-Best-Smell Approach for Remote Gas Detection with a Mobile Robot

Figure 3 for A Next-Best-Smell Approach for Remote Gas Detection with a Mobile Robot

Figure 4 for A Next-Best-Smell Approach for Remote Gas Detection with a Mobile Robot

Abstract:The problem of gas detection is relevant to many real-world applications, such as leak detection in industrial settings and landfill monitoring. Using mobile robots for gas detection has several advantages and can reduce danger for humans. In our work, we address the problem of planning a path for a mobile robotic platform equipped with a remote gas sensor, which minimizes the time to detect all gas sources in a given environment. We cast this problem as a coverage planning problem by defining a basic sensing operation -- a scan with the remote gas sensor -- as the field of "view" of the sensor. Given the computing effort required by previously proposed offline approaches, in this paper we suggest a online coverage algorithm, called Next-Best-Smell, adapted from the Next-Best-View class of exploration algorithms. Our algorithm evaluates candidate locations with a global utility function, which combines utility values for travel distance, information gain, and sensing time, using Multi-Criteria Decision Making. In our experiments, conducted both in simulation and with a real robot, we found the performance of the Next-Best-Smell approach to be comparable with that of the state-of-the-art offline algorithm, at much lower computational cost.

Via

Access Paper or Ask Questions