Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gloria Dal Santo

FLAMO: An Open-Source Library for Frequency-Domain Differentiable Audio Processing

Sep 13, 2024

Gloria Dal Santo, Gian Marco De Bortoli, Karolina Prawda, Sebastian J. Schlecht, Vesa Välimäki

Figure 1 for FLAMO: An Open-Source Library for Frequency-Domain Differentiable Audio Processing

Figure 2 for FLAMO: An Open-Source Library for Frequency-Domain Differentiable Audio Processing

Figure 3 for FLAMO: An Open-Source Library for Frequency-Domain Differentiable Audio Processing

Figure 4 for FLAMO: An Open-Source Library for Frequency-Domain Differentiable Audio Processing

Abstract:We present FLAMO, a Frequency-sampling Library for Audio-Module Optimization designed to implement and optimize differentiable linear time-invariant audio systems. The library is open-source and built on the frequency-sampling filter design method, allowing for the creation of differentiable modules that can be used stand-alone or within the computation graph of neural networks, simplifying the development of differentiable audio systems. It includes predefined filtering modules and auxiliary classes for constructing, training, and logging the optimized systems, all accessible through an intuitive interface. Practical application of these modules is demonstrated through two case studies: the optimization of an artificial reverberator and an active acoustics system for improved response smoothness.

Via

Access Paper or Ask Questions

Similarity Metrics For Late Reverberation

Aug 27, 2024

Gloria Dal Santo, Karolina Prawda, Sebastian J. Schlecht, Vesa Välimäki

Figure 1 for Similarity Metrics For Late Reverberation

Figure 2 for Similarity Metrics For Late Reverberation

Figure 3 for Similarity Metrics For Late Reverberation

Abstract:Automatic tuning of reverberation algorithms relies on the optimization of a cost function. While general audio similarity metrics are useful, they are not optimized for the specific statistical properties of reverberation in rooms. This paper presents two novel metrics for assessing the similarity of late reverberation in room impulse responses. These metrics are differentiable and can be utilized within a machine-learning framework. We compare the performance of these metrics to two popular audio metrics using a large dataset of room impulse responses encompassing various room configurations and microphone positions. The results indicate that the proposed functions based on averaged power and frequency-band energy decay outperform the baselines with the former exhibiting the most suitable profile towards the minimum. The proposed work holds promise as an improvement to the design and evaluation of reverberation similarity metrics.

Via

Access Paper or Ask Questions

Feedback Delay Network Optimization

Feb 17, 2024

Gloria Dal Santo, Karolina Prawda, Sebastian J. Schlecht, Vesa Välimäki

Figure 1 for Feedback Delay Network Optimization

Figure 2 for Feedback Delay Network Optimization

Figure 3 for Feedback Delay Network Optimization

Figure 4 for Feedback Delay Network Optimization

Abstract:A common bane of artificial reverberation algorithms is spectral coloration, typically manifesting as metallic ringing, leading to a degradation in the perceived sound quality. This paper presents an optimization framework where a differentiable feedback delay network is used to learn a set of parameters to reduce coloration iteratively. The parameters under optimization include the feedback matrix, as well as the input and output gains. The optimization objective is twofold: to maximize spectral flatness through a spectral loss while maintaining temporal density by penalizing sparseness in the parameter values. A favorable narrower distribution of modal excitation is achieved while maintaining the desired impulse response density. In a subjective assessment, the new method proves effective in reducing perceptual coloration of late reverberation. The proposed method achieves computational savings compared to the baseline while preserving its performance. The effectiveness of this work is demonstrated through two application scenarios where natural-sounding synthetic impulse responses are obtained via the introduction of attenuation filters and an optimizable scattering feedback matrix.

Via

Access Paper or Ask Questions