Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Babe Sultana

BANSpEmo: A Bangla Emotional Speech Recognition Dataset

Dec 21, 2023

Md Gulzar Hussain, Mahmuda Rahman, Babe Sultana, Ye Shiren

Figure 1 for BANSpEmo: A Bangla Emotional Speech Recognition Dataset

Figure 2 for BANSpEmo: A Bangla Emotional Speech Recognition Dataset

Figure 3 for BANSpEmo: A Bangla Emotional Speech Recognition Dataset

Figure 4 for BANSpEmo: A Bangla Emotional Speech Recognition Dataset

Abstract:In the field of audio and speech analysis, the ability to identify emotions from acoustic signals is essential. Human-computer interaction (HCI) and behavioural analysis are only a few of the many areas where the capacity to distinguish emotions from speech signals has an extensive range of applications. Here, we are introducing BanSpEmo, a corpus of emotional speech that only consists of audio recordings and has been created specifically for the Bangla language. This corpus contains 792 audio recordings over a duration of more than 1 hour and 23 minutes. 22 native speakers took part in the recording of two sets of sentences that represent the six desired emotions. The data set consists of 12 Bangla sentences which are uttered in 6 emotions as Disgust, Happy, Sad, Surprised, Anger, and Fear. This corpus is not also gender balanced. Ten individuals who either have experience in related field or have acting experience took part in the assessment of this corpus. It has a balanced number of audio recordings in each emotion class. BanSpEmo can be considered as a useful resource to promote emotion and speech recognition research and related applications in the Bangla language. The dataset can be found here: https://data.mendeley.com/datasets/rdwn4bs5ky and might be employed for academic research.

Via

Access Paper or Ask Questions