Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jule Pohlhausen

Long-Term Conversation Analysis: Privacy-Utility Trade-off under Noise and Reverberation

Aug 01, 2024

Jule Pohlhausen, Francesco Nespoli, Joerg Bitzer

Abstract:Recordings in everyday life require privacy preservation of the speech content and speaker identity. This contribution explores the influence of noise and reverberation on the trade-off between privacy and utility for low-cost privacy-preserving methods feasible for edge computing. These methods compromise spectral and temporal smoothing, speaker anonymization using the McAdams coefficient, sampling with a very low sampling rate, and combinations. Privacy is assessed by automatic speech and speaker recognition, while our utility considers voice activity detection and speaker diarization. Overall, our evaluation shows that additional noise degrades the performance of all models more than reverberation. This degradation corresponds to enhanced speech privacy, while utility is less deteriorated for some methods.

* Accepted for publication at IWAENC 2024

Via

Access Paper or Ask Questions

Long-term Conversation Analysis: Exploring Utility and Privacy

Jun 28, 2023

Francesco Nespoli, Jule Pohlhausen, Patrick A. Naylor, Joerg Bitzer

Abstract:The analysis of conversations recorded in everyday life requires privacy protection. In this contribution, we explore a privacy-preserving feature extraction method based on input feature dimension reduction, spectral smoothing and the low-cost speaker anonymization technique based on McAdams coefficient. We assess the utility of the feature extraction methods with a voice activity detection and a speaker diarization system, while privacy protection is determined with a speech recognition and a speaker verification model. We show that the combination of McAdams coefficient and spectral smoothing maintains the utility while improving privacy.

* Submitted to ITG Conference on Speech Communication, 2023

Via

Access Paper or Ask Questions