Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Antonio Joia Neto

Learning multiplane images from single views with self-supervision

Oct 19, 2021

Gustavo Sutter P. Carvalho, Diogo C. Luvizon, Antonio Joia Neto, Andre G. C. Pacheco, Otavio A. B. Penatti

Figure 1 for Learning multiplane images from single views with self-supervision

Figure 2 for Learning multiplane images from single views with self-supervision

Figure 3 for Learning multiplane images from single views with self-supervision

Figure 4 for Learning multiplane images from single views with self-supervision

Abstract:Generating static novel views from an already captured image is a hard task in computer vision and graphics, in particular when the single input image has dynamic parts such as persons or moving objects. In this paper, we tackle this problem by proposing a new framework, called CycleMPI, that is capable of learning a multiplane image representation from single images through a cyclic training strategy for self-supervision. Our framework does not require stereo data for training, therefore it can be trained with massive visual data from the Internet, resulting in a better generalization capability even for very challenging cases. Although our method does not require stereo data for supervision, it reaches results on stereo datasets comparable to the state of the art in a zero-shot scenario. We evaluated our method on RealEstate10K and Mannequin Challenge datasets for view synthesis and presented qualitative results on Places II dataset.

* To appear on BMVC 2021

Via

Access Paper or Ask Questions

Improving Deep Learning Sound Events Classifiers using Gram Matrix Feature-wise Correlations

Feb 23, 2021

Antonio Joia Neto, Andre G C Pacheco, Diogo C Luvizon

Figure 1 for Improving Deep Learning Sound Events Classifiers using Gram Matrix Feature-wise Correlations

Figure 2 for Improving Deep Learning Sound Events Classifiers using Gram Matrix Feature-wise Correlations

Abstract:In this paper, we propose a new Sound Event Classification (SEC) method which is inspired in recent works for out-of-distribution detection. In our method, we analyse all the activations of a generic CNN in order to produce feature representations using Gram Matrices. The similarity metrics are evaluated considering all possible classes, and the final prediction is defined as the class that minimizes the deviation with respect to the features seeing during training. The proposed approach can be applied to any CNN and our experimental evaluation of four different architectures on two datasets demonstrated that our method consistently improves the baseline models.

* To appear on ICASSP 2021

Via

Access Paper or Ask Questions