Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ali Osman Ulusoy

Superquadrics Revisited: Learning 3D Shape Parsing beyond Cuboids

Apr 22, 2019

Despoina Paschalidou, Ali Osman Ulusoy, Andreas Geiger

Figure 1 for Superquadrics Revisited: Learning 3D Shape Parsing beyond Cuboids

Figure 2 for Superquadrics Revisited: Learning 3D Shape Parsing beyond Cuboids

Figure 3 for Superquadrics Revisited: Learning 3D Shape Parsing beyond Cuboids

Figure 4 for Superquadrics Revisited: Learning 3D Shape Parsing beyond Cuboids

Abstract:Abstracting complex 3D shapes with parsimonious part-based representations has been a long standing goal in computer vision. This paper presents a learning-based solution to this problem which goes beyond the traditional 3D cuboid representation by exploiting superquadrics as atomic elements. We demonstrate that superquadrics lead to more expressive 3D scene parses while being easier to learn than 3D cuboid representations. Moreover, we provide an analytical solution to the Chamfer loss which avoids the need for computational expensive reinforcement learning or iterative prediction. Our model learns to parse 3D objects into consistent superquadric representations without supervision. Results on various ShapeNet categories as well as the SURREAL human body dataset demonstrate the flexibility of our model in capturing fine details and complex poses that could not have been modelled using cuboids.

* CVPR 2019 Camera Ready. Project Page:https://github.com/paschalidoud/superquadric_parsing

Via

Access Paper or Ask Questions

RayNet: Learning Volumetric 3D Reconstruction with Ray Potentials

Jan 06, 2019

Despoina Paschalidou, Ali Osman Ulusoy, Carolin Schmitt, Luc van Gool, Andreas Geiger

Figure 1 for RayNet: Learning Volumetric 3D Reconstruction with Ray Potentials

Figure 2 for RayNet: Learning Volumetric 3D Reconstruction with Ray Potentials

Figure 3 for RayNet: Learning Volumetric 3D Reconstruction with Ray Potentials

Figure 4 for RayNet: Learning Volumetric 3D Reconstruction with Ray Potentials

Abstract:In this paper, we consider the problem of reconstructing a dense 3D model using images captured from different views. Recent methods based on convolutional neural networks (CNN) allow learning the entire task from data. However, they do not incorporate the physics of image formation such as perspective geometry and occlusion. Instead, classical approaches based on Markov Random Fields (MRF) with ray-potentials explicitly model these physical processes, but they cannot cope with large surface appearance variations across different viewpoints. In this paper, we propose RayNet, which combines the strengths of both frameworks. RayNet integrates a CNN that learns view-invariant feature representations with an MRF that explicitly encodes the physics of perspective projection and occlusion. We train RayNet end-to-end using empirical risk minimization. We thoroughly evaluate our approach on challenging real-world datasets and demonstrate its benefits over a piece-wise trained baseline, hand-crafted models as well as other learning-based approaches.

* Accepted to CVPR 2018 as spotlight. Project url with code: http://raynet-mvs.com/

Via

Access Paper or Ask Questions

OctNetFusion: Learning Depth Fusion from Data

Oct 31, 2017

Gernot Riegler, Ali Osman Ulusoy, Horst Bischof, Andreas Geiger

Figure 1 for OctNetFusion: Learning Depth Fusion from Data

Figure 2 for OctNetFusion: Learning Depth Fusion from Data

Figure 3 for OctNetFusion: Learning Depth Fusion from Data

Figure 4 for OctNetFusion: Learning Depth Fusion from Data

Abstract:In this paper, we present a learning based approach to depth fusion, i.e., dense 3D reconstruction from multiple depth images. The most common approach to depth fusion is based on averaging truncated signed distance functions, which was originally proposed by Curless and Levoy in 1996. While this method is simple and provides great results, it is not able to reconstruct (partially) occluded surfaces and requires a large number frames to filter out sensor noise and outliers. Motivated by the availability of large 3D model repositories and recent advances in deep learning, we present a novel 3D CNN architecture that learns to predict an implicit surface representation from the input depth maps. Our learning based method significantly outperforms the traditional volumetric fusion approach in terms of noise reduction and outlier suppression. By learning the structure of real world 3D objects and scenes, our approach is further able to reconstruct occluded regions and to fill in gaps in the reconstruction. We demonstrate that our learning based approach outperforms both vanilla TSDF fusion as well as TV-L1 fusion on the task of volumetric fusion. Further, we demonstrate state-of-the-art 3D shape completion results.

* 3DV 2017, https://github.com/griegler/octnetfusion

Via

Access Paper or Ask Questions

OctNet: Learning Deep 3D Representations at High Resolutions

Apr 10, 2017

Gernot Riegler, Ali Osman Ulusoy, Andreas Geiger

Figure 1 for OctNet: Learning Deep 3D Representations at High Resolutions

Figure 2 for OctNet: Learning Deep 3D Representations at High Resolutions

Figure 3 for OctNet: Learning Deep 3D Representations at High Resolutions

Figure 4 for OctNet: Learning Deep 3D Representations at High Resolutions

Abstract:We present OctNet, a representation for deep learning with sparse 3D data. In contrast to existing models, our representation enables 3D convolutional networks which are both deep and high resolution. Towards this goal, we exploit the sparsity in the input data to hierarchically partition the space using a set of unbalanced octrees where each leaf node stores a pooled feature representation. This allows to focus memory allocation and computation to the relevant dense regions and enables deeper networks without compromising resolution. We demonstrate the utility of our OctNet representation by analyzing the impact of resolution on several 3D tasks including 3D object classification, orientation estimation and point cloud labeling.

* CVPR 2017 camera ready

Via

Access Paper or Ask Questions