Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:VoiceBench: Benchmarking LLM-Based Voice Assistants

Oct 22, 2024

Yiming Chen, Xianghu Yue, Chen Zhang, Xiaoxue Gao, Robby T. Tan, Haizhou Li

Figure 1 for VoiceBench: Benchmarking LLM-Based Voice Assistants

Figure 2 for VoiceBench: Benchmarking LLM-Based Voice Assistants

Figure 3 for VoiceBench: Benchmarking LLM-Based Voice Assistants

Figure 4 for VoiceBench: Benchmarking LLM-Based Voice Assistants

Share this with someone who'll enjoy it:

Abstract:Building on the success of large language models (LLMs), recent advancements such as GPT-4o have enabled real-time speech interactions through LLM-based voice assistants, offering a significantly improved user experience compared to traditional text-based interactions. However, the absence of benchmarks designed to evaluate these speech interaction capabilities has hindered progress of LLM-based voice assistants development. Current evaluations focus primarily on automatic speech recognition (ASR) or general knowledge evaluation with clean speeches, neglecting the more intricate, real-world scenarios that involve diverse speaker characteristics, environmental and content factors. To address this, we introduce VoiceBench, the first benchmark designed to provide a multi-faceted evaluation of LLM-based voice assistants. VoiceBench also includes both real and synthetic spoken instructions that incorporate the above three key real-world variations. Extensive experiments reveal the limitations of current LLM-based voice assistant models and offer valuable insights for future research and development in this field.

* Work in progress. Data is available at https://github.com/MatthewCYM/VoiceBench

View paper on

Share this with someone who'll enjoy it:

Title:VoiceBench: Benchmarking LLM-Based Voice Assistants

Paper and Code