Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rodrigo Diaz

Fast Differentiable Modal Simulation of Non-linear Strings, Membranes, and Plates

May 09, 2025

Rodrigo Diaz, Mark Sandler

Abstract:Modal methods for simulating vibrations of strings, membranes, and plates are widely used in acoustics and physically informed audio synthesis. However, traditional implementations, particularly for non-linear models like the von K\'arm\'an plate, are computationally demanding and lack differentiability, limiting inverse modelling and real-time applications. We introduce a fast, differentiable, GPU-accelerated modal framework built with the JAX library, providing efficient simulations and enabling gradient-based inverse modelling. Benchmarks show that our approach significantly outperforms CPU and GPU-based implementations, particularly for simulations with many modes. Inverse modelling experiments demonstrate that our approach can recover physical parameters, including tension, stiffness, and geometry, from both synthetic and experimental data. Although fitting physical parameters is more sensitive to initialisation compared to other methods, it provides greater interpretability and more compact parameterisation. The code is released as open source to support future research and applications in differentiable physical modelling and sound synthesis.

* accepted to DAFx 2025

Via

Access Paper or Ask Questions

Towards Efficient Modelling of String Dynamics: A Comparison of State Space and Koopman based Deep Learning Methods

Aug 29, 2024

Rodrigo Diaz, Carlos De La Vega Martin, Mark Sandler

Figure 1 for Towards Efficient Modelling of String Dynamics: A Comparison of State Space and Koopman based Deep Learning Methods

Figure 2 for Towards Efficient Modelling of String Dynamics: A Comparison of State Space and Koopman based Deep Learning Methods

Figure 3 for Towards Efficient Modelling of String Dynamics: A Comparison of State Space and Koopman based Deep Learning Methods

Figure 4 for Towards Efficient Modelling of String Dynamics: A Comparison of State Space and Koopman based Deep Learning Methods

Abstract:This paper presents an examination of State Space Models (SSM) and Koopman-based deep learning methods for modelling the dynamics of both linear and non-linear stiff strings. Through experiments with datasets generated under different initial conditions and sample rates, we assess the capacity of these models to accurately model the complex behaviours observed in string dynamics. Our findings indicate that our proposed Koopman-based model performs as well as or better than other existing approaches in non-linear cases for long-sequence modelling. We inform the design of these architectures with the structure of the problems at hand. Although challenges remain in extending model predictions beyond the training horizon (i.e., extrapolation), the focus of our investigation lies in the models' ability to generalise across different initial conditions within the training time interval. This research contributes insights into the physical modelling of dynamical systems (in particular those addressing musical acoustics) by offering a comparative overview of these and previous methods and introducing innovative strategies for model improvement. Our results highlight the efficacy of these models in simulating non-linear dynamics and emphasise their wide-ranging applicability in accurately modelling dynamical systems over extended sequences.

* Accepted to DAFx2024

Via

Access Paper or Ask Questions

Pipeline for recording datasets and running neural networks on the Bela embedded hardware platform

Jun 20, 2023

Teresa Pelinski, Rodrigo Diaz, Adán L. Benito Temprano, Andrew McPherson

Figure 1 for Pipeline for recording datasets and running neural networks on the Bela embedded hardware platform

Figure 2 for Pipeline for recording datasets and running neural networks on the Bela embedded hardware platform

Figure 3 for Pipeline for recording datasets and running neural networks on the Bela embedded hardware platform

Figure 4 for Pipeline for recording datasets and running neural networks on the Bela embedded hardware platform

Abstract:Deploying deep learning models on embedded devices is an arduous task: oftentimes, there exist no platform-specific instructions, and compilation times can be considerably large due to the limited computational resources available on-device. Moreover, many music-making applications demand real-time inference. Embedded hardware platforms for audio, such as Bela, offer an entry point for beginners into physical audio computing; however, the need for cross-compilation environments and low-level software development tools for deploying embedded deep learning models imposes high entry barriers on non-expert users. We present a pipeline for deploying neural networks in the Bela embedded hardware platform. In our pipeline, we include a tool to record a multichannel dataset of sensor signals. Additionally, we provide a dockerised cross-compilation environment for faster compilation. With this pipeline, we aim to provide a template for programmers and makers to prototype and experiment with neural networks for real-time embedded musical applications.

Via

Access Paper or Ask Questions

Interactive Neural Resonators

May 24, 2023

Rodrigo Diaz, Charalampos Saitis, Mark Sandler

Figure 1 for Interactive Neural Resonators

Figure 2 for Interactive Neural Resonators

Figure 3 for Interactive Neural Resonators

Figure 4 for Interactive Neural Resonators

Abstract:In this work, we propose a method for the controllable synthesis of real-time contact sounds using neural resonators. Previous works have used physically inspired statistical methods and physical modelling for object materials and excitation signals. Our method incorporates differentiable second-order resonators and estimates their coefficients using a neural network that is conditioned on physical parameters. This allows for interactive dynamic control and the generation of novel sounds in an intuitive manner. We demonstrate the practical implementation of our method and explore its potential creative applications.

Via

Access Paper or Ask Questions

Multi-View Mesh Reconstruction with Neural Deferred Shading

Dec 08, 2022

Markus Worchel, Rodrigo Diaz, Weiwen Hu, Oliver Schreer, Ingo Feldmann, Peter Eisert

Figure 1 for Multi-View Mesh Reconstruction with Neural Deferred Shading

Figure 2 for Multi-View Mesh Reconstruction with Neural Deferred Shading

Figure 3 for Multi-View Mesh Reconstruction with Neural Deferred Shading

Figure 4 for Multi-View Mesh Reconstruction with Neural Deferred Shading

Abstract:We propose an analysis-by-synthesis method for fast multi-view 3D reconstruction of opaque objects with arbitrary materials and illumination. State-of-the-art methods use both neural surface representations and neural rendering. While flexible, neural surface representations are a significant bottleneck in optimization runtime. Instead, we represent surfaces as triangle meshes and build a differentiable rendering pipeline around triangle rasterization and neural shading. The renderer is used in a gradient descent optimization where both a triangle mesh and a neural shader are jointly optimized to reproduce the multi-view images. We evaluate our method on a public 3D reconstruction dataset and show that it can match the reconstruction accuracy of traditional baselines and neural approaches while surpassing them in optimization runtime. Additionally, we investigate the shader and find that it learns an interpretable representation of appearance, enabling applications such as 3D material editing.

* CVPR 2022, project page: https://fraunhoferhhi.github.io/neural-deferred-shading/

Via

Access Paper or Ask Questions

Rigid-Body Sound Synthesis with Differentiable Modal Resonators

Oct 28, 2022

Rodrigo Diaz, Ben Hayes, Charalampos Saitis, György Fazekas, Mark Sandler

Figure 1 for Rigid-Body Sound Synthesis with Differentiable Modal Resonators

Figure 2 for Rigid-Body Sound Synthesis with Differentiable Modal Resonators

Figure 3 for Rigid-Body Sound Synthesis with Differentiable Modal Resonators

Figure 4 for Rigid-Body Sound Synthesis with Differentiable Modal Resonators

Abstract:Physical models of rigid bodies are used for sound synthesis in applications from virtual environments to music production. Traditional methods such as modal synthesis often rely on computationally expensive numerical solvers, while recent deep learning approaches are limited by post-processing of their results. In this work we present a novel end-to-end framework for training a deep neural network to generate modal resonators for a given 2D shape and material, using a bank of differentiable IIR filters. We demonstrate our method on a dataset of synthetic objects, but train our model using an audio-domain objective, paving the way for physically-informed synthesisers to be learned directly from recordings of real-world objects.

* 5 pages

Via

Access Paper or Ask Questions