Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Svetlana Seliunina

Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors

Nov 17, 2024

Svetlana Seliunina, Artem Otelepko, Raphael Memmesheimer, Sven Behnke

Figure 1 for Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors

Figure 2 for Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors

Figure 3 for Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors

Figure 4 for Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors

Abstract:Robots need to perceive persons in their surroundings for safety and to interact with them. In this paper, we present a person segmentation and action classification approach that operates on 3D scans of hemisphere field of view LiDAR sensors. We recorded a data set with an Ouster OSDome-64 sensor consisting of scenes where persons perform three different actions and annotated it. We propose a method based on a MaskDINO model to detect and segment persons and to recognize their actions from combined spherical projected multi-channel representations of the LiDAR data with an additional positional encoding. Our approach demonstrates good performance for the person segmentation task and further performs well for the estimation of the person action states walking, waving, and sitting. An ablation study provides insights about the individual channel contributions for the person segmentation task. The trained models, code and dataset are made publicly available.

* 6 pages, 9 figures, 4 tables, accepted for publication at IEEE/SICE International Symposium on System Integration (SII), Munich, Germany, January 2025

Via

Access Paper or Ask Questions