Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Soumya Sai Vanka

Diff-MSTC: A Mixing Style Transfer Prototype for Cubase

Nov 10, 2024

Soumya Sai Vanka, Lennart Hannink, Jean-Baptiste Rolland, George Fazekas

Abstract:In our demo, participants are invited to explore the Diff-MSTC prototype, which integrates the Diff-MST model into Steinberg's digital audio workstation (DAW), Cubase. Diff-MST, a deep learning model for mixing style transfer, forecasts mixing console parameters for tracks using a reference song. The system processes up to 20 raw tracks along with a reference song to predict mixing console parameters that can be used to create an initial mix. Users have the option to manually adjust these parameters further for greater control. In contrast to earlier deep learning systems that are limited to research ideas, Diff-MSTC is a first-of-its-kind prototype integrated into a DAW. This integration facilitates mixing decisions on multitracks and lets users input context through a reference song, followed by fine-tuning of audio effects in a traditional manner.

* Presented at 2024 International Society for Music Information Retrieval

Via

Access Paper or Ask Questions

Diff-MST: Differentiable Mixing Style Transfer

Jul 11, 2024

Soumya Sai Vanka, Christian Steinmetz, Jean-Baptiste Rolland, Joshua Reiss, George Fazekas

Abstract:Mixing style transfer automates the generation of a multitrack mix for a given set of tracks by inferring production attributes from a reference song. However, existing systems for mixing style transfer are limited in that they often operate only on a fixed number of tracks, introduce artifacts, and produce mixes in an end-to-end fashion, without grounding in traditional audio effects, prohibiting interpretability and controllability. To overcome these challenges, we introduce Diff-MST, a framework comprising a differentiable mixing console, a transformer controller, and an audio production style loss function. By inputting raw tracks and a reference song, our model estimates control parameters for audio effects within a differentiable mixing console, producing high-quality mixes and enabling post-hoc adjustments. Moreover, our architecture supports an arbitrary number of input tracks without source labelling, enabling real-world applications. We evaluate our model's performance against robust baselines and showcase the effectiveness of our approach, architectural design, tailored audio production style loss, and innovative training methodology for the given task.

* Accepted to be published at the Proceedings of the 25th International Society for Music Information Retrieval Conference 2024

Via

Access Paper or Ask Questions

The Role of Communication and Reference Songs in the Mixing Process: Insights from Professional Mix Engineers

Sep 08, 2023

Soumya Sai Vanka, Maryam Safi, Jean-Baptiste Rolland, György Fazekas

Figure 1 for The Role of Communication and Reference Songs in the Mixing Process: Insights from Professional Mix Engineers

Figure 2 for The Role of Communication and Reference Songs in the Mixing Process: Insights from Professional Mix Engineers

Figure 3 for The Role of Communication and Reference Songs in the Mixing Process: Insights from Professional Mix Engineers

Figure 4 for The Role of Communication and Reference Songs in the Mixing Process: Insights from Professional Mix Engineers

Abstract:Effective music mixing requires technical and creative finesse, but clear communication with the client is crucial. The mixing engineer must grasp the client's expectations, and preferences, and collaborate to achieve the desired sound. The tacit agreement for the desired sound of the mix is often established using guides like reference songs and demo mixes exchanged between the artist and the engineer and sometimes verbalised using semantic terms. This paper presents the findings of a two-phased exploratory study aimed at understanding how professional mixing engineers interact with clients and use their feedback to guide the mixing process. For phase one, semi-structured interviews were conducted with five mixing engineers with the aim of gathering insights about their communication strategies, creative processes, and decision-making criteria. Based on the inferences from these interviews, an online questionnaire was designed and administered to a larger group of 22 mixing engineers during the second phase. The results of this study shed light on the importance of collaboration, empathy, and intention in the mixing process, and can inform the development of smart multi-track mixing systems that better support these practices. By highlighting the significance of these findings, this paper contributes to the growing body of research on the collaborative nature of music production and provides actionable recommendations for the design and implementation of innovative mixing tools.

* Submitted to Journal of Audio Engineering Society in July 2023. Awaiting reviews and acceptance notifications

Via

Access Paper or Ask Questions

Adoption of AI Technology in the Music Mixing Workflow: An Investigation

Apr 06, 2023

Soumya Sai Vanka, Maryam Safi, Jean-Baptiste Rolland, George Fazekas

Figure 1 for Adoption of AI Technology in the Music Mixing Workflow: An Investigation

Figure 2 for Adoption of AI Technology in the Music Mixing Workflow: An Investigation

Figure 3 for Adoption of AI Technology in the Music Mixing Workflow: An Investigation

Figure 4 for Adoption of AI Technology in the Music Mixing Workflow: An Investigation

Abstract:The integration of artificial intelligence (AI) technology in the music industry is driving a significant change in the way music is being composed, produced and mixed. This study investigates the current state of AI in the mixing workflows and its adoption by different user groups. Through semi-structured interviews, a questionnaire-based study, and analyzing web forums, the study confirms three user groups comprising amateurs, pro-ams, and professionals. Our findings show that while AI mixing tools can simplify the process and provide decent results for amateurs, pro-ams seek precise control and customization options, while professionals desire control and customization options in addition to assistive and collaborative technologies. The study provides strategies for designing effective AI mixing tools for different user groups and outlines future directions.

* To be published at the 154th AES Europe Convention, 2023 in Helsinki, Finland

Via

Access Paper or Ask Questions