Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Petros Drakoulis

Hybrid Skip: A Biologically Inspired Skip Connection for the UNet Architecture

Jul 11, 2022

Nikolaos Zioulis, Georgios Albanis, Petros Drakoulis, Federico Alvarez, Dimitrios Zarpalas, Petros Daras

Figure 1 for Hybrid Skip: A Biologically Inspired Skip Connection for the UNet Architecture

Figure 2 for Hybrid Skip: A Biologically Inspired Skip Connection for the UNet Architecture

Figure 3 for Hybrid Skip: A Biologically Inspired Skip Connection for the UNet Architecture

Figure 4 for Hybrid Skip: A Biologically Inspired Skip Connection for the UNet Architecture

Abstract:In this work we introduce a biologically inspired long-range skip connection for the UNet architecture that relies on the perceptual illusion of hybrid images, being images that simultaneously encode two images. The fusion of early encoder features with deeper decoder ones allows UNet models to produce finer-grained dense predictions. While proven in segmentation tasks, the network's benefits are down-weighted for dense regression tasks as these long-range skip connections additionally result in texture transfer artifacts. Specifically for depth estimation, this hurts smoothness and introduces false positive edges which are detrimental to the task due to the depth maps' piece-wise smooth nature. The proposed HybridSkip connections show improved performance in balancing the trade-off between edge preservation, and the minimization of texture transfer artifacts that hurt smoothness. This is achieved by the proper and balanced exchange of information that Hybrid-Skip connections offer between the high and low frequency, encoder and decoder features, respectively.

* IEEE Access, Volume 10, 53928 - 53939, 17 May 2022
* Project page at https://vcl3d.github.io/HybridSkip/

Via

Access Paper or Ask Questions

A benchmark with decomposed distribution shifts for 360 monocular depth estimation

Dec 01, 2021

Georgios Albanis, Nikolaos Zioulis, Petros Drakoulis, Federico Alvarez, Dimitrios Zarpalas, Petros Daras

Figure 1 for A benchmark with decomposed distribution shifts for 360 monocular depth estimation

Figure 2 for A benchmark with decomposed distribution shifts for 360 monocular depth estimation

Figure 3 for A benchmark with decomposed distribution shifts for 360 monocular depth estimation

Figure 4 for A benchmark with decomposed distribution shifts for 360 monocular depth estimation

Abstract:In this work we contribute a distribution shift benchmark for a computer vision task; monocular depth estimation. Our differentiation is the decomposition of the wider distribution shift of uncontrolled testing on in-the-wild data, to three distinct distribution shifts. Specifically, we generate data via synthesis and analyze them to produce covariate (color input), prior (depth output) and concept (their relationship) distribution shifts. We also synthesize combinations and show how each one is indeed a different challenge to address, as stacking them produces increased performance drops and cannot be addressed horizontally using standard approaches.

Via

Access Paper or Ask Questions

HUMAN4D: A Human-Centric Multimodal Dataset for Motions and Immersive Media

Oct 19, 2021

Anargyros Chatzitofis, Leonidas Saroglou, Prodromos Boutis, Petros Drakoulis, Nikolaos Zioulis, Shishir Subramanyam, Bart Kevelham, Caecilia Charbonnier, Pablo Cesar, Dimitrios Zarpalas(+2 more)

Figure 1 for HUMAN4D: A Human-Centric Multimodal Dataset for Motions and Immersive Media

Figure 2 for HUMAN4D: A Human-Centric Multimodal Dataset for Motions and Immersive Media

Figure 3 for HUMAN4D: A Human-Centric Multimodal Dataset for Motions and Immersive Media

Figure 4 for HUMAN4D: A Human-Centric Multimodal Dataset for Motions and Immersive Media

Abstract:We introduce HUMAN4D, a large and multimodal 4D dataset that contains a variety of human activities simultaneously captured by a professional marker-based MoCap, a volumetric capture and an audio recording system. By capturing 2 female and $2$ male professional actors performing various full-body movements and expressions, HUMAN4D provides a diverse set of motions and poses encountered as part of single- and multi-person daily, physical and social activities (jumping, dancing, etc.), along with multi-RGBD (mRGBD), volumetric and audio data. Despite the existence of multi-view color datasets captured with the use of hardware (HW) synchronization, to the best of our knowledge, HUMAN4D is the first and only public resource that provides volumetric depth maps with high synchronization precision due to the use of intra- and inter-sensor HW-SYNC. Moreover, a spatio-temporally aligned scanned and rigged 3D character complements HUMAN4D to enable joint research on time-varying and high-quality dynamic meshes. We provide evaluation baselines by benchmarking HUMAN4D with state-of-the-art human pose estimation and 3D compression methods. For the former, we apply 2D and 3D pose estimation algorithms both on single- and multi-view data cues. For the latter, we benchmark open-source 3D codecs on volumetric data respecting online volumetric video encoding and steady bit-rates. Furthermore, qualitative and quantitative visual comparison between mesh-based volumetric data reconstructed in different qualities showcases the available options with respect to 4D representations. HUMAN4D is introduced to the computer vision and graphics research communities to enable joint research on spatio-temporally aligned pose, volumetric, mRGBD and audio data cues. The dataset and its code are available https://tofis.github.io/myurls/human4d.

* IEEE Access, 8, 176241-176262, 2020

Via

Access Paper or Ask Questions

Pano3D: A Holistic Benchmark and a Solid Baseline for $360^o$ Depth Estimation

Sep 06, 2021

Georgios Albanis, Nikolaos Zioulis, Petros Drakoulis, Vasileios Gkitsas, Vladimiros Sterzentsenko, Federico Alvarez, Dimitrios Zarpalas, Petros Daras

Figure 1 for Pano3D: A Holistic Benchmark and a Solid Baseline for $360^o$ Depth Estimation

Figure 2 for Pano3D: A Holistic Benchmark and a Solid Baseline for $360^o$ Depth Estimation

Figure 3 for Pano3D: A Holistic Benchmark and a Solid Baseline for $360^o$ Depth Estimation

Figure 4 for Pano3D: A Holistic Benchmark and a Solid Baseline for $360^o$ Depth Estimation

Abstract:Pano3D is a new benchmark for depth estimation from spherical panoramas. It aims to assess performance across all depth estimation traits, the primary direct depth estimation performance targeting precision and accuracy, and also the secondary traits, boundary preservation, and smoothness. Moreover, Pano3D moves beyond typical intra-dataset evaluation to inter-dataset performance assessment. By disentangling the capacity to generalize to unseen data into different test splits, Pano3D represents a holistic benchmark for $360^o$ depth estimation. We use it as a basis for an extended analysis seeking to offer insights into classical choices for depth estimation. This results in a solid baseline for panoramic depth that follow-up works can build upon to steer future progress.

* Presented at the OmniCV CVPR 2021 workshop. Code, models, data and demo at https://vcl3d.github.io/Pano3D/

Via

Access Paper or Ask Questions