Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Piyush Papreja

Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers

Apr 04, 2021

Loren Lugosch, Piyush Papreja, Mirco Ravanelli, Abdelwahab Heba, Titouan Parcollet

Figure 1 for Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers

Figure 2 for Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers

Figure 3 for Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers

Figure 4 for Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers

Abstract:This paper introduces Timers and Such, a new open source dataset of spoken English commands for common voice control use cases involving numbers. We describe the gap in existing spoken language understanding datasets that Timers and Such fills, the design and creation of the dataset, and experiments with a number of ASR-based and end-to-end baseline models, the code for which has been made available as part of the SpeechBrain toolkit.

Via

Access Paper or Ask Questions

Representation, Exploration and Recommendation of Music Playlists

Jul 01, 2019

Piyush Papreja, Hemanth Venkateswara, Sethuraman Panchanathan

Figure 1 for Representation, Exploration and Recommendation of Music Playlists

Figure 2 for Representation, Exploration and Recommendation of Music Playlists

Figure 3 for Representation, Exploration and Recommendation of Music Playlists

Figure 4 for Representation, Exploration and Recommendation of Music Playlists

Abstract:Playlists have become a significant part of our listening experience because of the digital cloud-based services such as Spotify, Pandora, Apple Music. Owing to the meteoric rise in the usage of playlists, recommending playlists is crucial to music services today. Although there has been a lot of work done in playlist prediction, the area of playlist representation hasn't received that level of attention. Over the last few years, sequence-to-sequence models, especially in the field of natural language processing, have shown the effectiveness of learned embeddings in capturing the semantic characteristics of sequences. We can apply similar concepts to music to learn fixed length representations for playlists and use those representations for downstream tasks such as playlist discovery, browsing, and recommendation. In this work, we formulate the problem of learning a fixed-length playlist representation in an unsupervised manner, using Sequence-to-sequence (Seq2seq) models, interpreting playlists as sentences and songs as words. We compare our model with two other encoding architectures for baseline comparison. We evaluate our work using the suite of tasks commonly used for assessing sentence embeddings, along with a few additional tasks pertaining to music, and a recommendation task to study the traits captured by the playlist embeddings and their effectiveness for the purpose of music recommendation.

Via

Access Paper or Ask Questions