Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kamil Akesbi

Music Augmentation and Denoising For Peak-Based Audio Fingerprinting

Oct 29, 2023

Kamil Akesbi, Dorian Desblancs, Benjamin Martin

Figure 1 for Music Augmentation and Denoising For Peak-Based Audio Fingerprinting

Figure 2 for Music Augmentation and Denoising For Peak-Based Audio Fingerprinting

Figure 3 for Music Augmentation and Denoising For Peak-Based Audio Fingerprinting

Figure 4 for Music Augmentation and Denoising For Peak-Based Audio Fingerprinting

Abstract:Audio fingerprinting is a well-established solution for song identification from short recording excerpts. Popular methods rely on the extraction of sparse representations, generally spectral peaks, and have proven to be accurate, fast, and scalable to large collections. However, real-world applications of audio identification often happen in noisy environments, which can cause these systems to fail. In this work, we tackle this problem by introducing and releasing a new audio augmentation pipeline that adds noise to music snippets in a realistic way, by stochastically mimicking real-world scenarios. We then propose and release a deep learning model that removes noisy components from spectrograms in order to improve peak-based fingerprinting systems' accuracy. We show that the addition of our model improves the identification performance of commonly used audio fingerprinting systems, even under noisy conditions.

Via

Access Paper or Ask Questions

Audio Denoising for Robust Audio Fingerprinting

Dec 21, 2022

Kamil Akesbi

Abstract:Music discovery services let users identify songs from short mobile recordings. These solutions are often based on Audio Fingerprinting, and rely more specifically on the extraction of spectral peaks in order to be robust to a number of distortions. Few works have been done to study the robustness of these algorithms to background noise captured in real environments. In particular, AFP systems still struggle when the signal to noise ratio is low, i.e when the background noise is strong. In this project, we tackle this problematic with Deep Learning. We test a new hybrid strategy which consists of inserting a denoising DL model in front of a peak-based AFP algorithm. We simulate noisy music recordings using a realistic data augmentation pipeline, and train a DL model to denoise them. The denoising model limits the impact of background noise on the AFP system's extracted peaks, improving its robustness to noise. We further propose a novel loss function to adapt the DL model to the considered AFP system, increasing its precision in terms of retrieved spectral peaks. To the best of our knowledge, this hybrid strategy has not been tested before.

* 63 pages, master thesis

Via

Access Paper or Ask Questions