Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mikhail G. Mozerov

3D Shapes Local Geometry Codes Learning with SDF

Aug 19, 2021

Shun Yao, Fei Yang, Yongmei Cheng, Mikhail G. Mozerov

Figure 1 for 3D Shapes Local Geometry Codes Learning with SDF

Figure 2 for 3D Shapes Local Geometry Codes Learning with SDF

Figure 3 for 3D Shapes Local Geometry Codes Learning with SDF

Figure 4 for 3D Shapes Local Geometry Codes Learning with SDF

Abstract:A signed distance function (SDF) as the 3D shape description is one of the most effective approaches to represent 3D geometry for rendering and reconstruction. Our work is inspired by the state-of-the-art method DeepSDF that learns and analyzes the 3D shape as the iso-surface of its shell and this method has shown promising results especially in the 3D shape reconstruction and compression domain. In this paper, we consider the degeneration problem of reconstruction coming from the capacity decrease of the DeepSDF model, which approximates the SDF with a neural network and a single latent code. We propose Local Geometry Code Learning (LGCL), a model that improves the original DeepSDF results by learning from a local shape geometry of the full 3D shape. We add an extra graph neural network to split the single transmittable latent code into a set of local latent codes distributed on the 3D shape. Mentioned latent codes are used to approximate the SDF in their local regions, which will alleviate the complexity of the approximation compared to the original DeepSDF. Furthermore, we introduce a new geometric loss function to facilitate the training of these local latent codes. Note that other local shape adjusting methods use the 3D voxel representation, which in turn is a problem highly difficult to solve or even is insolvable. In contrast, our architecture is based on graph processing implicitly and performs the learning regression process directly in the latent code space, thus make the proposed architecture more flexible and also simple for realization. Our experiments on 3D shape reconstruction demonstrate that our LGCL method can keep more details with a significantly smaller size of the SDF decoder and outperforms considerably the original DeepSDF method under the most important quantitative metrics.

* DLGC workshop in ICCV 2021

Via

Access Paper or Ask Questions

Slimmable Compressive Autoencoders for Practical Neural Image Compression

Mar 29, 2021

Fei Yang, Luis Herranz, Yongmei Cheng, Mikhail G. Mozerov

Figure 1 for Slimmable Compressive Autoencoders for Practical Neural Image Compression

Figure 2 for Slimmable Compressive Autoencoders for Practical Neural Image Compression

Figure 3 for Slimmable Compressive Autoencoders for Practical Neural Image Compression

Figure 4 for Slimmable Compressive Autoencoders for Practical Neural Image Compression

Abstract:Neural image compression leverages deep neural networks to outperform traditional image codecs in rate-distortion performance. However, the resulting models are also heavy, computationally demanding and generally optimized for a single rate, limiting their practical use. Focusing on practical image compression, we propose slimmable compressive autoencoders (SlimCAEs), where rate (R) and distortion (D) are jointly optimized for different capacities. Once trained, encoders and decoders can be executed at different capacities, leading to different rates and complexities. We show that a successful implementation of SlimCAEs requires suitable capacity-specific RD tradeoffs. Our experiments show that SlimCAEs are highly flexible models that provide excellent rate-distortion performance, variable rate, and dynamic adjustment of memory, computational cost and latency, thus addressing the main requirements of practical image compression.

* Accepted to CVPR 2021

Via

Access Paper or Ask Questions

Sparse data interpolation using the geodesic distance affinity space

May 06, 2019

Mikhail G. Mozerov, Fei Yang, Joost van de Weijer

Figure 1 for Sparse data interpolation using the geodesic distance affinity space

Figure 2 for Sparse data interpolation using the geodesic distance affinity space

Figure 3 for Sparse data interpolation using the geodesic distance affinity space

Figure 4 for Sparse data interpolation using the geodesic distance affinity space

Abstract:In this paper, we adapt the geodesic distance-based recursive filter to the sparse data interpolation problem. The proposed technique is general and can be easily applied to any kind of sparse data. We demonstrate the superiority over other interpolation techniques in three experiments for qualitative and quantitative evaluation. In addition, we compare our method with the popular interpolation algorithm presented in the EpicFlow optical flow paper that is intuitively motivated by a similar geodesic distance principle. The comparison shows that our algorithm is more accurate and considerably faster than the EpicFlow interpolation technique.

Via

Access Paper or Ask Questions

One-view occlusion detection for stereo matching with a fully connected CRF model

Jan 12, 2019

Mikhail G. Mozerov, Joost van de Weijer

Figure 1 for One-view occlusion detection for stereo matching with a fully connected CRF model

Figure 2 for One-view occlusion detection for stereo matching with a fully connected CRF model

Figure 3 for One-view occlusion detection for stereo matching with a fully connected CRF model

Figure 4 for One-view occlusion detection for stereo matching with a fully connected CRF model

Abstract:In this paper, we extend the standard belief propagation (BP) sequential technique proposed in the tree-reweighted sequential method to the fully connected CRF models with the geodesic distance affinity. The proposed method has been applied to the stereo matching problem. Also a new approach to the BP marginal solution is proposed that we call one-view occlusion detection (OVOD). In contrast to the standard winner takes all (WTA) estimation, the proposed OVOD solution allows to find occluded regions in the disparity map and simultaneously improve the matching result. As a result we can perform only one energy minimization process and avoid the cost calculation for the second view and the left-right check procedure. We show that the OVOD approach considerably improves results for cost augmentation and energy minimization techniques in comparison with the standard one-view affinity space implementation. We apply our method to the Middlebury data set and reach state-of-the-art especially for median, average and mean squared error metrics.

Via

Access Paper or Ask Questions

Extended Dynamic Programming and Fast Multidimensional Search Algorithm for Energy Minization in Stereo and Motion

Oct 29, 2014

Mikhail G. Mozerov

Figure 1 for Extended Dynamic Programming and Fast Multidimensional Search Algorithm for Energy Minization in Stereo and Motion

Figure 2 for Extended Dynamic Programming and Fast Multidimensional Search Algorithm for Energy Minization in Stereo and Motion

Figure 3 for Extended Dynamic Programming and Fast Multidimensional Search Algorithm for Energy Minization in Stereo and Motion

Figure 4 for Extended Dynamic Programming and Fast Multidimensional Search Algorithm for Energy Minization in Stereo and Motion

Abstract:This paper presents a novel extended dynamic programming approach for energy minimization (EDP) to solve the correspondence problem for stereo and motion. A significant speedup is achieved using a recursive minimum search strategy (RMS). The mentioned speedup is particularly important if the disparity space is 2D as well as 3D. The proposed RMS can also be applied in the well-known dynamic programming (DP) approach for stereo and motion. In this case, the general 2D problem of the global discrete energy minimization is reduced to several mutually independent sub-problems of the one-dimensional minimization. The EDP method is used when the approximation of the general 2D discrete energy minimization problem is considered. Then the RMS algorithm is an essential part of the EDP method. Using the EDP algorithm we obtain a lower energy bound than the graph cuts (GC) expansion technique on stereo and motion problems. The proposed calculation scheme possesses natural parallelism and can be realized on graphics processing unit (GPU) platforms, and can be potentially restricted further by the number of scanlines in the image plane. Furthermore, the RMS and EDP methods can be used in any optimization problem where the objective function meets specific conditions in the smoothness term.

* 12 pages, 11 figures

Via

Access Paper or Ask Questions