Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chandan Reddy

Aura: Privacy-preserving augmentation to improve test set diversity in noise suppression applications

Oct 08, 2021

Xavier Gitiaux, Aditya Khant, Chandan Reddy, Jayant Gupchup, Ross Cutler

Figure 1 for Aura: Privacy-preserving augmentation to improve test set diversity in noise suppression applications

Figure 2 for Aura: Privacy-preserving augmentation to improve test set diversity in noise suppression applications

Figure 3 for Aura: Privacy-preserving augmentation to improve test set diversity in noise suppression applications

Figure 4 for Aura: Privacy-preserving augmentation to improve test set diversity in noise suppression applications

Abstract:Noise suppression models running in production environments are commonly trained on publicly available datasets. However, this approach leads to regressions in production environments due to the lack of training/testing on representative customer data. Moreover, due to privacy reasons, developers cannot listen to customer content. This `ears-off' situation motivates augmenting existing datasets in a privacy-preserving manner. In this paper, we present Aura, a solution to make existing noise suppression test sets more challenging and diverse while limiting the sampling budget. Aura is `ears-off' because it relies on a feature extractor and a metric of speech quality, DNSMOS P.835, both pre-trained on data obtained from public sources. As an application of \aura, we augment a current benchmark test set in noise suppression by sampling audio files from a new batch of data of 20K clean speech clips from Librivox mixed with noise clips obtained from AudioSet. Aura makes the existing benchmark test set harder by 100% in DNSMOS P.835, a 26 improvement in Spearman's rank correlation coefficient (SRCC) compared to random sampling and, identifies 73% out-of-distribution samples to augment the test set.

Via

Access Paper or Ask Questions