Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tak-Shing T. Chan

Backpropagation with N-D Vector-Valued Neurons Using Arbitrary Bilinear Products

May 24, 2018

Zhe-Cheng Fan, Tak-Shing T. Chan, Yi-Hsuan Yang, Jyh-Shing R. Jang

Figure 1 for Backpropagation with N-D Vector-Valued Neurons Using Arbitrary Bilinear Products

Figure 2 for Backpropagation with N-D Vector-Valued Neurons Using Arbitrary Bilinear Products

Figure 3 for Backpropagation with N-D Vector-Valued Neurons Using Arbitrary Bilinear Products

Figure 4 for Backpropagation with N-D Vector-Valued Neurons Using Arbitrary Bilinear Products

Abstract:Vector-valued neural learning has emerged as a promising direction in deep learning recently. Traditionally, training data for neural networks (NNs) are formulated as a vector of scalars; however, its performance may not be optimal since associations among adjacent scalars are not modeled. In this paper, we propose a new vector neural architecture called the Arbitrary BIlinear Product Neural Network (ABIPNN), which processes information as vectors in each neuron, and the feedforward projections are defined using arbitrary bilinear products. Such bilinear products can include circular convolution, seven-dimensional vector product, skew circular convolution, reversed- time circular convolution, or other new products not seen in previous work. As a proof-of-concept, we apply our proposed network to multispectral image denoising and singing voice sepa- ration. Experimental results show that ABIPNN gains substantial improvements when compared to conventional NNs, suggesting that associations are learned during training.

* 14 pages, 8 figures, 3 tables

Via

Access Paper or Ask Questions

Informed Group-Sparse Representation for Singing Voice Separation

Jan 09, 2018

Tak-Shing T. Chan, Yi-Hsuan Yang

Figure 1 for Informed Group-Sparse Representation for Singing Voice Separation

Figure 2 for Informed Group-Sparse Representation for Singing Voice Separation

Figure 3 for Informed Group-Sparse Representation for Singing Voice Separation

Figure 4 for Informed Group-Sparse Representation for Singing Voice Separation

Abstract:Singing voice separation attempts to separate the vocal and instrumental parts of a music recording, which is a fundamental problem in music information retrieval. Recent work on singing voice separation has shown that the low-rank representation and informed separation approaches are both able to improve separation quality. However, low-rank optimizations are computationally inefficient due to the use of singular value decompositions. Therefore, in this paper, we propose a new linear-time algorithm called informed group-sparse representation, and use it to separate the vocals from music using pitch annotations as side information. Experimental results on the iKala dataset confirm the efficacy of our approach, suggesting that the music accompaniment follows a group-sparse structure given a pre-trained instrumental dictionary. We also show how our work can be easily extended to accommodate multiple dictionaries using the DSD100 dataset.

* IEEE Signal Process. Lett., vol. 24, no. 2, pp. 156-160, Feb. 2017
* 5 pages, 1 figure

Via

Access Paper or Ask Questions

Polar $n$-Complex and $n$-Bicomplex Singular Value Decomposition and Principal Component Pursuit

Jan 09, 2018

Tak-Shing T. Chan, Yi-Hsuan Yang

Figure 1 for Polar $n$-Complex and $n$-Bicomplex Singular Value Decomposition and Principal Component Pursuit

Figure 2 for Polar $n$-Complex and $n$-Bicomplex Singular Value Decomposition and Principal Component Pursuit

Figure 3 for Polar $n$-Complex and $n$-Bicomplex Singular Value Decomposition and Principal Component Pursuit

Figure 4 for Polar $n$-Complex and $n$-Bicomplex Singular Value Decomposition and Principal Component Pursuit

Abstract:Informed by recent work on tensor singular value decomposition and circulant algebra matrices, this paper presents a new theoretical bridge that unifies the hypercomplex and tensor-based approaches to singular value decomposition and robust principal component analysis. We begin our work by extending the principal component pursuit to Olariu's polar $n$-complex numbers as well as their bicomplex counterparts. In so doing, we have derived the polar $n$-complex and $n$-bicomplex proximity operators for both the $\ell_1$- and trace-norm regularizers, which can be used by proximal optimization methods such as the alternating direction method of multipliers. Experimental results on two sets of audio data show that our algebraically-informed formulation outperforms tensor robust principal component analysis. We conclude with the message that an informed definition of the trace norm can bridge the gap between the hypercomplex and tensor-based approaches. Our approach can be seen as a general methodology for generating other principal component pursuit algorithms with proper algebraic structures.

* IEEE Trans. Signal Process., vol. 64, no. 24, pp. 6533-6544, Dec. 2016
* 12 pages, 2 figures

Via

Access Paper or Ask Questions

Complex and Quaternionic Principal Component Pursuit and Its Application to Audio Separation

Jan 09, 2018

Tak-Shing T. Chan, Yi-Hsuan Yang

Figure 1 for Complex and Quaternionic Principal Component Pursuit and Its Application to Audio Separation

Figure 2 for Complex and Quaternionic Principal Component Pursuit and Its Application to Audio Separation

Figure 3 for Complex and Quaternionic Principal Component Pursuit and Its Application to Audio Separation

Abstract:Recently, the principal component pursuit has received increasing attention in signal processing research ranging from source separation to video surveillance. So far, all existing formulations are real-valued and lack the concept of phase, which is inherent in inputs such as complex spectrograms or color images. Thus, in this letter, we extend principal component pursuit to the complex and quaternionic cases to account for the missing phase information. Specifically, we present both complex and quaternionic proximity operators for the $\ell_1$- and trace-norm regularizers. These operators can be used in conjunction with proximal minimization methods such as the inexact augmented Lagrange multiplier algorithm. The new algorithms are then applied to the singing voice separation problem, which aims to separate the singing voice from the instrumental accompaniment. Results on the iKala and MSD100 datasets confirmed the usefulness of phase information in principal component pursuit.

* IEEE Signal Process. Lett., vol. 23, no. 2, pp. 287-291, Feb. 2016
* 5 pages, 1 figure

Via

Access Paper or Ask Questions