Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Johannes Petzold

Efficient Semantic Segmentation for Visual Bird's-eye View Interpretation

Nov 29, 2018

Timo Sämann, Karl Amende, Stefan Milz, Christian Witt, Martin Simon, Johannes Petzold

Figure 1 for Efficient Semantic Segmentation for Visual Bird's-eye View Interpretation

Figure 2 for Efficient Semantic Segmentation for Visual Bird's-eye View Interpretation

Figure 3 for Efficient Semantic Segmentation for Visual Bird's-eye View Interpretation

Figure 4 for Efficient Semantic Segmentation for Visual Bird's-eye View Interpretation

Abstract:The ability to perform semantic segmentation in real-time capable applications with limited hardware is of great importance. One such application is the interpretation of the visual bird's-eye view, which requires the semantic segmentation of the four omnidirectional camera images. In this paper, we present an efficient semantic segmentation that sets new standards in terms of runtime and hardware requirements. Our two main contributions are the decrease of the runtime by parallelizing the ArgMax layer and the reduction of hardware requirements by applying the channel pruning method to the ENet model.

* Advances in Intelligent Systems and Computing 2018

Via

Access Paper or Ask Questions

Monocular Fisheye Camera Depth Estimation Using Sparse LiDAR Supervision

Sep 24, 2018

Varun Ravi Kumar, Stefan Milz, Martin Simon, Christian Witt, Karl Amende, Johannes Petzold, Senthil Yogamani, Timo Pech

Figure 1 for Monocular Fisheye Camera Depth Estimation Using Sparse LiDAR Supervision

Figure 2 for Monocular Fisheye Camera Depth Estimation Using Sparse LiDAR Supervision

Figure 3 for Monocular Fisheye Camera Depth Estimation Using Sparse LiDAR Supervision

Figure 4 for Monocular Fisheye Camera Depth Estimation Using Sparse LiDAR Supervision

Abstract:Near field depth estimation around a self driving car is an important function that can be achieved by four wide angle fisheye cameras having a field of view of over 180. Depth estimation based on convolutional neural networks (CNNs) produce state of the art results, but progress is hindered because depth annotation cannot be obtained manually. Synthetic datasets are commonly used but they have limitations. For instance, they do not capture the extensive variability in the appearance of objects like vehicles present in real datasets. There is also a domain shift while performing inference on natural images illustrated by many attempts to handle the domain adaptation explicitly. In this work, we explore an alternate approach of training using sparse LiDAR data as ground truth for depth estimation for fisheye camera. We built our own dataset using our self driving car setup which has a 64 beam Velodyne LiDAR and four wide angle fisheye cameras. To handle the difference in view points of LiDAR and fisheye camera, an occlusion resolution mechanism was implemented. We started with Eigen's multiscale convolutional network architecture and improved by modifying activation function and optimizer. We obtained promising results on our dataset with RMSE errors comparable to the state of the art results obtained on KITTI.

Via

Access Paper or Ask Questions