Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nils Carlson

Does Phase Matter For Monaural Source Separation?

Nov 02, 2017

Mohit Dubey, Garrett Kenyon, Nils Carlson, Austin Thresher

Figure 1 for Does Phase Matter For Monaural Source Separation?

Figure 2 for Does Phase Matter For Monaural Source Separation?

Figure 3 for Does Phase Matter For Monaural Source Separation?

Abstract:The "cocktail party" problem of fully separating multiple sources from a single channel audio waveform remains unsolved. Current biological understanding of neural encoding suggests that phase information is preserved and utilized at every stage of the auditory pathway. However, current computational approaches primarily discard phase information in order to mask amplitude spectrograms of sound. In this paper, we seek to address whether preserving phase information in spectral representations of sound provides better results in monaural separation of vocals from a musical track by using a neurally plausible sparse generative model. Our results demonstrate that preserving phase information reduces artifacts in the separated tracks, as quantified by the signal to artifact ratio (GSAR). Furthermore, our proposed method achieves state-of-the-art performance for source separation, as quantified by a mean signal to interference ratio (GSIR) of 19.46.

* 4 pages, 2 figures, NIPS format

Via

Access Paper or Ask Questions

Phase Transitions in Image Denoising via Sparsely Coding Convolutional Neural Networks

Oct 26, 2017

Jacob Carroll, Nils Carlson, Garrett T. Kenyon

Figure 1 for Phase Transitions in Image Denoising via Sparsely Coding Convolutional Neural Networks

Figure 2 for Phase Transitions in Image Denoising via Sparsely Coding Convolutional Neural Networks

Figure 3 for Phase Transitions in Image Denoising via Sparsely Coding Convolutional Neural Networks

Abstract:Neural networks are analogous in many ways to spin glasses, systems which are known for their rich set of dynamics and equally complex phase diagrams. We apply well-known techniques in the study of spin glasses to a convolutional sparsely encoding neural network and observe power law finite-size scaling behavior in the sparsity and reconstruction error as the network denoises 32$\times$32 RGB CIFAR-10 images. This finite-size scaling indicates the presence of a continuous phase transition at a critical value of this sparsity. By using the power law scaling relations inherent to finite-size scaling, we can determine the optimal value of sparsity for any network size by tuning the system to the critical point and operate the system at the minimum denoising error.

* 4 pages, 3 figures, submitted to NIPS 2017 workshop: Advances in Modeling and Learning Interactions from Complex Data

Via

Access Paper or Ask Questions