Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Synthesizing Soundscapes: Leveraging Text-to-Audio Models for Environmental Sound Classification

Mar 26, 2024

Francesca Ronchini, Luca Comanducci, Fabio Antonacci

Figure 1 for Synthesizing Soundscapes: Leveraging Text-to-Audio Models for Environmental Sound Classification

Figure 2 for Synthesizing Soundscapes: Leveraging Text-to-Audio Models for Environmental Sound Classification

Figure 3 for Synthesizing Soundscapes: Leveraging Text-to-Audio Models for Environmental Sound Classification

Figure 4 for Synthesizing Soundscapes: Leveraging Text-to-Audio Models for Environmental Sound Classification

Share this with someone who'll enjoy it:

Abstract:In the past few years, text-to-audio models have emerged as a significant advancement in automatic audio generation. Although they represent impressive technological progress, the effectiveness of their use in the development of audio applications remains uncertain. This paper aims to investigate these aspects, specifically focusing on the task of classification of environmental sounds. This study analyzes the performance of two different environmental classification systems when data generated from text-to-audio models is used for training. Two cases are considered: a) when the training dataset is augmented by data coming from two different text-to-audio models; and b) when the training dataset consists solely of synthetic audio generated. In both cases, the performance of the classification task is tested on real data. Results indicate that text-to-audio models are effective for dataset augmentation, whereas the performance of the models drops when relying on only generated audio.

* Submitted to EUSIPCO 2024

View paper on

Share this with someone who'll enjoy it:

Title:Synthesizing Soundscapes: Leveraging Text-to-Audio Models for Environmental Sound Classification

Paper and Code