Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Neural Scoring, Not Embedding: A Novel Framework for Robust Speaker Verification

Oct 21, 2024

Wan Lin, Junhui Chen, Tianhao Wang, Zhenyu Zhou, Lantian Li, Dong Wang

Figure 1 for Neural Scoring, Not Embedding: A Novel Framework for Robust Speaker Verification

Figure 2 for Neural Scoring, Not Embedding: A Novel Framework for Robust Speaker Verification

Figure 3 for Neural Scoring, Not Embedding: A Novel Framework for Robust Speaker Verification

Share this with someone who'll enjoy it:

Abstract:Current mainstream speaker verification systems are predominantly based on the concept of ``speaker embedding", which transforms variable-length speech signals into fixed-length speaker vectors, followed by verification based on cosine similarity between the embeddings of the enrollment and test utterances. However, this approach suffers from considerable performance degradation in the presence of severe noise and interference speakers. This paper introduces Neural Scoring, a novel framework that re-treats speaker verification as a scoring task using a Transformer-based architecture. The proposed method first extracts an embedding from the enrollment speech and frame-level features from the test speech. A Transformer network then generates a decision score that quantifies the likelihood of the enrolled speaker being present in the test speech. We evaluated Neural Scoring on the VoxCeleb dataset across five test scenarios, comparing it with the state-of-the-art embedding-based approach. While Neural Scoring achieves comparable performance to the state-of-the-art under the benchmark (clean) test condition, it demonstrates a remarkable advantage in the four complex scenarios, achieving an overall 64.53% reduction in equal error rate (EER) compared to the baseline.

View paper on

Share this with someone who'll enjoy it:

Title:Neural Scoring, Not Embedding: A Novel Framework for Robust Speaker Verification

Paper and Code