Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sumin Shin

Music2Video: Automatic Generation of Music Video with fusion of audio and text

Jan 11, 2022

Joel Jang, Sumin Shin, Yoonjeon Kim

Figure 1 for Music2Video: Automatic Generation of Music Video with fusion of audio and text

Figure 2 for Music2Video: Automatic Generation of Music Video with fusion of audio and text

Figure 3 for Music2Video: Automatic Generation of Music Video with fusion of audio and text

Abstract:Creation of images using generative adversarial networks has been widely adapted into multi-modal regime with the advent of multi-modal representation models pre-trained on large corpus. Various modalities sharing a common representation space could be utilized to guide the generative models to create images from text or even from audio source. Departing from the previous methods that solely rely on either text or audio, we exploit the expressiveness of both modality. Based on the fusion of text and audio, we create video whose content is consistent with the distinct modalities that are provided. A simple approach to automatically segment the video into variable length intervals and maintain time consistency in generated video is part of our method. Our proposed framework for generating music video shows promising results in application level where users can interactively feed in music source and text source to create artistic music videos. Our code is available at https://github.com/joeljang/music2video.

Via

Access Paper or Ask Questions

FedMix: Approximation of Mixup under Mean Augmented Federated Learning

Jul 01, 2021

Tehrim Yoon, Sumin Shin, Sung Ju Hwang, Eunho Yang

Figure 1 for FedMix: Approximation of Mixup under Mean Augmented Federated Learning

Figure 2 for FedMix: Approximation of Mixup under Mean Augmented Federated Learning

Figure 3 for FedMix: Approximation of Mixup under Mean Augmented Federated Learning

Figure 4 for FedMix: Approximation of Mixup under Mean Augmented Federated Learning

Abstract:Federated learning (FL) allows edge devices to collectively learn a model without directly sharing data within each device, thus preserving privacy and eliminating the need to store data globally. While there are promising results under the assumption of independent and identically distributed (iid) local data, current state-of-the-art algorithms suffer from performance degradation as the heterogeneity of local data across clients increases. To resolve this issue, we propose a simple framework, Mean Augmented Federated Learning (MAFL), where clients send and receive averaged local data, subject to the privacy requirements of target applications. Under our framework, we propose a new augmentation algorithm, named FedMix, which is inspired by a phenomenal yet simple data augmentation method, Mixup, but does not require local raw data to be directly shared among devices. Our method shows greatly improved performance in the standard benchmark datasets of FL, under highly non-iid federated settings, compared to conventional algorithms.

* ICLR 2021

Via

Access Paper or Ask Questions