Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking

Jun 07, 2024

Junzuo Zhou, Jiangyan Yi, Tao Wang, Jianhua Tao, Ye Bai, Chu Yuan Zhang, Yong Ren, Zhengqi Wen

Figure 1 for TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking

Figure 2 for TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking

Figure 3 for TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking

Figure 4 for TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking

Share this with someone who'll enjoy it:

Abstract:Various threats posed by the progress in text-to-speech (TTS) have prompted the need to reliably trace synthesized speech. However, contemporary approaches to this task involve adding watermarks to the audio separately after generation, a process that hurts both speech quality and watermark imperceptibility. In addition, these approaches are limited in robustness and flexibility. To address these problems, we propose TraceableSpeech, a novel TTS model that directly generates watermarked speech, improving watermark imperceptibility and speech quality. Furthermore, We design the frame-wise imprinting and extraction of watermarks, achieving higher robustness against resplicing attacks and temporal flexibility in operation. Experimental results show that TraceableSpeech outperforms the strong baseline where VALL-E or HiFicodec individually uses WavMark in watermark imperceptibility, speech quality and resilience against resplicing attacks. It also can apply to speech of various durations.

* acceped by interspeech 2024

View paper on

Share this with someone who'll enjoy it:

Title:TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking

Paper and Code