Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Adrien Bardet

Universal audio synthesizer control with normalizing flows

Jul 01, 2019

Philippe Esling, Naotake Masuda, Adrien Bardet, Romeo Despres, Axel Chemla--Romeu-Santos

Figure 1 for Universal audio synthesizer control with normalizing flows

Figure 2 for Universal audio synthesizer control with normalizing flows

Figure 3 for Universal audio synthesizer control with normalizing flows

Figure 4 for Universal audio synthesizer control with normalizing flows

Abstract:The ubiquity of sound synthesizers has reshaped music production and even entirely defined new music genres. However, the increasing complexity and number of parameters in modern synthesizers make them harder to master. Hence, the development of methods allowing to easily create and explore with synthesizers is a crucial need. Here, we introduce a novel formulation of audio synthesizer control. We formalize it as finding an organized latent audio space that represents the capabilities of a synthesizer, while constructing an invertible mapping to the space of its parameters. By using this formulation, we show that we can address simultaneously automatic parameter inference, macro-control learning and audio-based preset exploration within a single model. To solve this new formulation, we rely on Variational Auto-Encoders (VAE) and Normalizing Flows (NF) to organize and map the respective auditory and parameter spaces. We introduce the disentangling flows, which allow to perform the invertible mapping between separate latent spaces, while steering the organization of some latent dimensions to match target variation factors by splitting the objective as partial density evaluation. We evaluate our proposal against a large set of baseline models and show its superiority in both parameter inference and audio reconstruction. We also show that the model disentangles the major factors of audio variations as latent dimensions, that can be directly used as macro-parameters. We also show that our model is able to learn semantic controls of a synthesizer by smoothly mapping to its parameters. Finally, we discuss the use of our model in creative applications and its real-time implementation in Ableton Live

* DaFX 2019

Via

Access Paper or Ask Questions

LIUM-CVC Submissions for WMT18 Multimodal Translation Task

Sep 01, 2018

Ozan Caglayan, Adrien Bardet, Fethi Bougares, Loïc Barrault, Kai Wang, Marc Masana, Luis Herranz, Joost van de Weijer

Figure 1 for LIUM-CVC Submissions for WMT18 Multimodal Translation Task

Figure 2 for LIUM-CVC Submissions for WMT18 Multimodal Translation Task

Figure 3 for LIUM-CVC Submissions for WMT18 Multimodal Translation Task

Figure 4 for LIUM-CVC Submissions for WMT18 Multimodal Translation Task

Abstract:This paper describes the multimodal Neural Machine Translation systems developed by LIUM and CVC for WMT18 Shared Task on Multimodal Translation. This year we propose several modifications to our previous multimodal attention architecture in order to better integrate convolutional features and refine them using encoder-side information. Our final constrained submissions ranked first for English-French and second for English-German language pairs among the constrained submissions according to the automatic evaluation metric METEOR.

* WMT2018

Via

Access Paper or Ask Questions

LIUM Machine Translation Systems for WMT17 News Translation Task

Jul 14, 2017

Mercedes García-Martínez, Ozan Caglayan, Walid Aransa, Adrien Bardet, Fethi Bougares, Loïc Barrault

Figure 1 for LIUM Machine Translation Systems for WMT17 News Translation Task

Figure 2 for LIUM Machine Translation Systems for WMT17 News Translation Task

Figure 3 for LIUM Machine Translation Systems for WMT17 News Translation Task

Figure 4 for LIUM Machine Translation Systems for WMT17 News Translation Task

Abstract:This paper describes LIUM submissions to WMT17 News Translation Task for English-German, English-Turkish, English-Czech and English-Latvian language pairs. We train BPE-based attentive Neural Machine Translation systems with and without factored outputs using the open source nmtpy framework. Competitive scores were obtained by ensembling various systems and exploiting the availability of target monolingual corpora for back-translation. The impact of back-translation quantity and quality is also analyzed for English-Turkish where our post-deadline submission surpassed the best entry by +1.6 BLEU.

* News Translation Task System Description paper for WMT17

Via

Access Paper or Ask Questions

LIUM-CVC Submissions for WMT17 Multimodal Translation Task

Jul 14, 2017

Ozan Caglayan, Walid Aransa, Adrien Bardet, Mercedes García-Martínez, Fethi Bougares, Loïc Barrault, Marc Masana, Luis Herranz, Joost van de Weijer

Figure 1 for LIUM-CVC Submissions for WMT17 Multimodal Translation Task

Figure 2 for LIUM-CVC Submissions for WMT17 Multimodal Translation Task

Figure 3 for LIUM-CVC Submissions for WMT17 Multimodal Translation Task

Figure 4 for LIUM-CVC Submissions for WMT17 Multimodal Translation Task

Abstract:This paper describes the monomodal and multimodal Neural Machine Translation systems developed by LIUM and CVC for WMT17 Shared Task on Multimodal Translation. We mainly explored two multimodal architectures where either global visual features or convolutional feature maps are integrated in order to benefit from visual context. Our final systems ranked first for both En-De and En-Fr language pairs according to the automatic evaluation metrics METEOR and BLEU.

* MMT System Description Paper for WMT17

Via

Access Paper or Ask Questions

NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems

Jun 01, 2017

Ozan Caglayan, Mercedes García-Martínez, Adrien Bardet, Walid Aransa, Fethi Bougares, Loïc Barrault

Figure 1 for NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems

Figure 2 for NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems

Figure 3 for NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems

Figure 4 for NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems

Abstract:In this paper, we present nmtpy, a flexible Python toolkit based on Theano for training Neural Machine Translation and other neural sequence-to-sequence architectures. nmtpy decouples the specification of a network from the training and inference utilities to simplify the addition of a new architecture and reduce the amount of boilerplate code to be written. nmtpy has been used for LIUM's top-ranked submissions to WMT Multimodal Machine Translation and News Translation tasks in 2016 and 2017.

* 10 pages, 3 figures

Via

Access Paper or Ask Questions