Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:V2S attack: building DNN-based voice conversion from automatic speaker verification

Aug 05, 2019

Taiki Nakamura, Yuki Saito, Shinnosuke Takamichi, Yusuke Ijima, Hiroshi Saruwatari

Figure 1 for V2S attack: building DNN-based voice conversion from automatic speaker verification

Figure 2 for V2S attack: building DNN-based voice conversion from automatic speaker verification

Figure 3 for V2S attack: building DNN-based voice conversion from automatic speaker verification

Figure 4 for V2S attack: building DNN-based voice conversion from automatic speaker verification

Share this with someone who'll enjoy it:

Abstract:This paper presents a new voice impersonation attack using voice conversion (VC). Enrolling personal voices for automatic speaker verification (ASV) offers natural and flexible biometric authentication systems. Basically, the ASV systems do not include the users' voice data. However, if the ASV system is unexpectedly exposed and hacked by a malicious attacker, there is a risk that the attacker will use VC techniques to reproduce the enrolled user's voices. We name this the ``verification-to-synthesis (V2S) attack'' and propose VC training with the ASV and pre-trained automatic speech recognition (ASR) models and without the targeted speaker's voice data. The VC model reproduces the targeted speaker's individuality by deceiving the ASV model and restores phonetic property of an input voice by matching phonetic posteriorgrams predicted by the ASR model. The experimental evaluation compares converted voices between the proposed method that does not use the targeted speaker's voice data and the standard VC that uses the data. The experimental results demonstrate that the proposed method performs comparably to the existing VC methods that trained using a very small amount of parallel voice data.

* 5 pages, 2 figures, accepted for The 10th ISCA Speech Synthesis Workshop (SSW10)

View paper on

Share this with someone who'll enjoy it:

Title:V2S attack: building DNN-based voice conversion from automatic speaker verification

Paper and Code