Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models

Nov 06, 2024

Wen-Chin Huang, Erica Cooper, Tomoki Toda

Share this with someone who'll enjoy it:

Abstract:Subjective speech quality assessment (SSQA) is critical for evaluating speech samples as perceived by human listeners. While model-based SSQA has enjoyed great success thanks to the development of deep neural networks (DNNs), generalization remains a key challenge, especially for unseen, out-of-domain data. To benchmark the generalization abilities of SSQA models, we present MOS-Bench, a diverse collection of datasets. In addition, we also introduce SHEET, an open-source toolkit containing complete recipes to conduct SSQA experiments. We provided benchmark results for MOS-Bench, and we also explored multi-dataset training to enhance generalization. Additionally, we proposed a new performance metric, best score difference/ratio, and used latent space visualizations to explain model behavior, offering valuable insights for future research.

* Submitted to Transactions on Audio, Speech and Language Processing. This work has been submitted to the IEEE for possible publication

View paper on

Share this with someone who'll enjoy it:

Title:MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models

Paper and Code