Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction

Jun 25, 2022

Roshan Sharma, Tyler Vuong, Mark Lindsey, Hira Dhamyal, Rita Singh, Bhiksha Raj

Figure 1 for Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction

Figure 2 for Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction

Figure 3 for Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction

Figure 4 for Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction

Share this with someone who'll enjoy it:

Abstract:This work presents a multitask approach to the simultaneous estimation of age, country of origin, and emotion given vocal burst audio for the 2022 ICML Expressive Vocalizations Challenge ExVo-MultiTask track. The method of choice utilized a combination of spectro-temporal modulation and self-supervised features, followed by an encoder-decoder network organized in a multitask paradigm. We evaluate the complementarity between the tasks posed by examining independent task-specific and joint models, and explore the relative strengths of different feature sets. We also introduce a simple score fusion mechanism to leverage the complementarity of different feature sets for this task. We find that robust data preprocessing in conjunction with score fusion over spectro-temporal receptive field and HuBERT models achieved our best ExVo-MultiTask test score of 0.412.

* Proceedings of the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022

View paper on

Share this with someone who'll enjoy it:

Title:Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction

Paper and Code