Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition

May 06, 2022

Yuan Gong, Jin Yu, James Glass

Figure 1 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition

Figure 2 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition

Figure 3 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition

Figure 4 for Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition

Share this with someone who'll enjoy it:

Abstract:Recognizing human non-speech vocalizations is an important task and has broad applications such as automatic sound transcription and health condition monitoring. However, existing datasets have a relatively small number of vocal sound samples or noisy labels. As a consequence, state-of-the-art audio event classification models may not perform well in detecting human vocal sounds. To support research on building robust and accurate vocal sound recognition, we have created a VocalSound dataset consisting of over 21,000 crowdsourced recordings of laughter, sighs, coughs, throat clearing, sneezes, and sniffs from 3,365 unique subjects. Experiments show that the vocal sound recognition performance of a model can be significantly improved by 41.9% by adding VocalSound dataset to an existing dataset as training material. In addition, different from previous datasets, the VocalSound dataset contains meta information such as speaker age, gender, native language, country, and health condition.

* Accepted at ICASSP 2022. Dataset and code at https://github.com/YuanGongND/vocalsound Interactive Colab demo at https://colab.research.google.com/github/YuanGongND/vocalsound/blob/main/colab/VocalSound.ipynb

View paper on

Share this with someone who'll enjoy it:

Title:Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition

Paper and Code