Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Apr 07, 2021

Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai

Figure 1 for Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Figure 2 for Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Figure 3 for Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Figure 4 for Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Share this with someone who'll enjoy it:

Abstract:Generative probability models are widely used for speaker verification (SV). However, the generative models are lack of discriminative feature selection ability. As a hypothesis test, the SV can be regarded as a binary classification task which can be designed as a Siamese neural network (SiamNN) with discriminative training. However, in most of the discriminative training for SiamNN, only the distribution of pair-wised sample distances is considered, and the additional discriminative information in joint distribution of samples is ignored. In this paper, we propose a novel SiamNN with consideration of the joint distribution of samples. The joint distribution of samples is first formulated based on a joint Bayesian (JB) based generative model, then a SiamNN is designed with dense layers to approximate the factorized affine transforms as used in the JB model. By initializing the SiamNN with the learned model parameters of the JB model, we further train the model parameters with the pair-wised samples as a binary discrimination task for SV. We carried out SV experiments on data corpus of speakers in the wild (SITW) and VoxCeleb. Experimental results showed that our proposed model improved the performance with a large margin compared with state of the art models for SV.

* arXiv admin note: substantial text overlap with arXiv:2101.03329

View paper on

Share this with someone who'll enjoy it:

Title:Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Paper and Code