Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Angeliki Mourgela

Exploring trends in audio mixes and masters: Insights from a dataset analysis

Dec 04, 2024

Angeliki Mourgela, Elio Quinton, Spyridon Bissas, Joshua D. Reiss, David Ronan

Figure 1 for Exploring trends in audio mixes and masters: Insights from a dataset analysis

Figure 2 for Exploring trends in audio mixes and masters: Insights from a dataset analysis

Figure 3 for Exploring trends in audio mixes and masters: Insights from a dataset analysis

Figure 4 for Exploring trends in audio mixes and masters: Insights from a dataset analysis

Abstract:We present an analysis of a dataset of audio metrics and aesthetic considerations about mixes and masters provided by the web platform MixCheck studio. The platform is designed for educational purposes, primarily targeting amateur music producers, and aimed at analysing their recordings prior to them being released. The analysis focuses on the following data points: integrated loudness, mono compatibility, presence of clipping and phase issues, compression and tonal profile across 30 user-specified genres. Both mixed (mixes) and mastered audio (masters) are included in the analysis, where mixes refer to the initial combination and balance of individual tracks, and masters refer to the final refined version optimized for distribution. Results show that loudness-related issues along with dynamics issues are the most prevalent, particularly in mastered audio. However mastered audio presents better results in compression than just mixed audio. Additionally, results show that mastered audio has a lower percentage of stereo field and phase issues.

* 11 pages, 6 figures, Presented at the AES 157th Convention October 2024, New York, USA

Via

Access Paper or Ask Questions

An automatic mixing speech enhancement system for multi-track audio

Apr 27, 2024

Xiaojing Liu, Angeliki Mourgela, Hongwei Ai, Joshua D. Reiss

Abstract:We propose a speech enhancement system for multitrack audio. The system will minimize auditory masking while allowing one to hear multiple simultaneous speakers. The system can be used in multiple communication scenarios e.g., teleconferencing, invoice gaming, and live streaming. The ITU-R BS.1387 Perceptual Evaluation of Audio Quality (PEAQ) model is used to evaluate the amount of masking in the audio signals. Different audio effects e.g., level balance, equalization, dynamic range compression, and spatialization are applied via an iterative Harmony searching algorithm that aims to minimize the masking. In the subjective listening test, the designed system can compete with mixes by professional sound engineers and outperforms mixes by existing auto-mixing systems.

* 5 pages

Via

Access Paper or Ask Questions