Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sung Min Ban

Phase-Aware Spoof Speech Detection Based on Res2Net with Phase Network

Mar 21, 2022

Juntae Kim, Sung Min Ban

Figure 1 for Phase-Aware Spoof Speech Detection Based on Res2Net with Phase Network

Figure 2 for Phase-Aware Spoof Speech Detection Based on Res2Net with Phase Network

Figure 3 for Phase-Aware Spoof Speech Detection Based on Res2Net with Phase Network

Figure 4 for Phase-Aware Spoof Speech Detection Based on Res2Net with Phase Network

Abstract:The spoof speech detection (SSD) is the essential countermeasure for automatic speaker verification systems. Although SSD with magnitude features in the frequency domain has shown promising results, the phase information also can be important to capture the artefacts of certain types of spoofing attacks. Thus, both magnitude and phase features must be considered to ensure the generalization ability to diverse types of spoofing attacks. In this paper, we investigate the failure reason of feature-level fusion of the previous works through the entropy analysis from which we found that the randomness difference between magnitude and phase features is large, which can interrupt the feature-level fusion via backend neural network; thus, we propose a phase network to reduce that difference. Our SSD system: phase network equipped Res2Net achieved significant performance improvement, specifically in the spoofing attack for which the phase information is considered to be important. Also, we demonstrate our SSD system in both known- and unknown-kind SSD scenarios for practical applications.

Via

Access Paper or Ask Questions